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(57) Abstract 

The present invention provides a system for producing and modifying 
natural products produced by a large group of bacteria for the purpose of drug 
discovery, development and production. The method of the invention transfers 
the ability to produce a secondary metabolite from an actinomycete that is the 
original producer of the natural product, to a different production host that has 
desirable characteristics. The system involves the construction of a segment 
of the chromosome of the original producer in an artificial chromosome that 
can be stably maintained in a suitable production host. The present invention 
relates to recombinant DNA vectors useful for shuttling the genetic information 
necessary to synthesize a given natural product between a donor organism and a 
production host. The methods of the invention are useful in improving the yield, 
the purification process and for structural modification of a natural product. 
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METHODS FOR TRANSFERRING THA CAPABILITY TO PRODUCE A NATURAL 
PRODUCT INTO A SUITABLE PRODUCTION HOST. 

5 1. FIELD OF THE INVENTION 

The present invention relates to a novel approach for drug 
discovery. More particularly, the invention relates to a 
system for improving the process of lead optimization and 
development of compounds, when these compounds are natural 

10 products produced by microorganisms belonging to the order 
Actinomycetales or chemical derivatives of these compounds. 
The invention relates to a system for transferring the 
capability to produce a natural product from a 
microorganism belonging to the order Actinomycetales into a 

15 defined host, where said natural product can be optimally 
produced and its biosynthetic pathway suitably modified. 

2. BACKGROUND ART 

Natural products are complex molecules with important uses 
20 in medicine. Examples include: antibacterial agents, such 
as erythromycin, teicoplanin, tetacycline; antitumor 
compounds, such as dauxorubicin; antihelmintic compounds, 
such as avermectin; immunosuppressive agents, such as 
cyclosporin and FK506; antifungal compounds, such as 
25 amphotericin and nystatin; etc. Natural products are 
produced as secondary metabolites by a wide range of living 
organisms. Although many secondary metabolites have been 
identified, there remains the need . to obtain novel 
structures with new activities or enhanced properties. 
30 Current methods of obtaining such molecules include 
screening of natural isolates and chemical modification of 
existing ones. Random screening of natural products from 
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disparate sources has resulted in the discovery of many 
important drugs and is still employed for seeking for novel 
activities. This process, which consists in exposing a 
miniaturized biological system to tens or hundreds of 
thousands of different compounds, in order 10 find those 
few that exhibit a desired property, is designated high 
throughput screening, or HTS. 

One of the used sources widely in HTS is a collection 
of natural products produced by small-scale fermentation of 
newly isolated microorganisms. A natural product may have 
one or more potential therapeutic properties, including but 
not limited to antibacterial, antifungal, antiviral, 
antitumor, iramunomodulating or other pharmacological 
properties. Natural products have long constituted a 
source of interesting, structurally original and 
"imaginative" molecules endowed with potent biological 
activities. In addition, recent observations indicate that 
only a small fraction of the microbial flora present in 
environmental samples, ranging from 0.01 to 1% according to 
the estimates, is related to known species. Microorganisms 
belonging to the order Act inomycet ales represent thus far 
the group of producers unsurpassed for chemical and 
biological diversity. However, more than 15,000 natural 
products produced by microorganisms have been described, 
and the chances of finding new structures are relatively 
small, unless efforts are directed towards those classes of 
microorganisms that have been little exploited in the past. 
Poorly characterized actinomycete genera can thus 
constitute a useful source of novel structures. With proper, 
methodologies, unusual genera can be isolated from 
environmental samples and some of these isolates will 
produce interesting activities. These could either 
represent completely new entities, or known molecules 
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acting on a novel target or in a previously unreported way. 
Many of these products will have original structures and 
potent biological activities. However, newly discovered 
secondary metabolites will be produced for the most part by 
microorganisms which have been isolated for the 
characteristic of being unusual and selected for their 
ability to produce a. given bioactivity. Consequently, 
little will be known about the best conditions for growth, 
productivity and storage. Often the microorganism does not 
produce a single bioactive compound, and other, unrelated 
activities must be completely removed for a meaningful 
evaluation of the properties of the lead compound. 
Furthermore, rarely is a secondary metabolite produced as a 
single, bioactive molecule, but is often present as a 
"complex" of several, closely related compounds, only some 
of which may possess the desired biological or chemical 
properties. Therefore, physiological conditions, such as 
nutrient and cofactor supply, that allow obtaining a 
"controlled" complex need to be established empirically by a 
trial and error approach. Finally, the natural product may 
need structural modification, and this can be achieved only 
by chemical means. In essence, the scarce knowledge 
available on the physiology and genetics of the producing 
strain will severely hamper the lead optimization and 
development processes . 

Chemical modification of preexisting natural products 
has been successfully employed to generate derivatives of 
natural products, but it still suffers from practical 
limitations to the type of compounds obtainable. Many 
natural products are often structurally complex molecules, 
with relatively large molecular weights. Due to their 
structural complexity, total synthesis of natural products 
is often prohibitive for the number of necessary steps and 
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the overall yield; furthermore, selective modification of a 
natural product can often be efficiently performed only on 
limited portions of the molecule. This difficulty of 
generating structural derivatives by conventional medicinal 

5 chemistry slows down the process of lead optimization and 
supply. Microorganisms employ intricate biosynthetic 
machineries to make natural products: for example, 
synthesis of the macrolide antibiotic erythromycin, a 
secondary metabolite in the medium-range structural 

10 complexity, requires the participation of over 40 different 
enzymatic activities (Katz and Donadio, 1995, Macrolides, 
in Genetics and Biochemistry of Antibiotic Production, 
Vining and Stuttard eds . , Butterworth-Heinemann, Boston CT, 
p. 385-420) . Biosynthetic pathways can often be redirected 

15 through manipulation of the fermentation conditions or of 
the biosynthesis genes, in order to produce desired analogs 
of the original structure. The availability of genes 
involved in the formation of secondary metabolites has been 
exploited for the formation of derivatives of natural 

20 products obtained after genetic manipulation of the 
producing organism (Hopwood, 1997, Chem. Rev. 99:0-39). 
These manipulations have resulted in novel molecules, many 
of which would be extremely hard if not impossible to 
produce by chemical derivatization of the parent compound. 

25 The obvious economical and environmental benefits resulting 
from the formation of the desired structure in one 
fermentation step constitute an additional stimulus for the 
application of pathway engineering for the rational design 
of novel structures. The compounds obtained in this way are 

30 amenable evaluation of their biological properties as well 
as being substrates for further derivatization by chemical 
or biological means. 

In summary, the supply of a natural product produced 
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by a newly discovered microorganism, the optimization of 
the complex composition, and the process of lead 
optimization will all benefit from a detailed knowledge of 
the genetics and physiology of the producing strain. The 
5 present invention describes a general method for 
transferring the capability to produce any secondary 
metabolite from the original actinomycete producer to an 
established and genetically manipulatable production host. 
The general concept of the invention is illustrated in Fig. 

10 1. Conditions for optimal growth, metabolite production and 
maintenance need therefore to be developed for one host. In 
addition, the availability of the cloned genes in a 
genetically manipulatable and well characterized host 
allows the utilization of all the genetic tools developed 

15 for these strains for the creation of novel derivatives of 
the natural product after genetic intervention. 

3. SUMMARY OF THE INVENTION 

The present invention provides a system for producing and 
20 manipulating natural products produced by a large group of 
bacteria for the purpose of drug discovery, development and 
production. The method of the invention transfers the 
ability to produce a secondary metabolite from an 
actinomycete that is the original producer of the natural 
25 product, to another production host that has desirable 
characteristics . 

In one embodiment, the invention involves the 
construction of a library from a donor organism, the 
producer of a natural product, in an Artificial Chromosome 
30 that can be shuttled between a convenient, neutral cloning 
host, such as the bacterium Escherichia coli, and a 
production host, such as the actinomycetes Streptomyces 
lividans or Streptomyces coelicolor . The clones directing 
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the synthesis of the natural product are identified in said 
library, transferred into the production host where said 
natural product is synthesized. 

In another embodiment, the invention involves the 

5 reconstruction of a large segment that directs the 
synthesis of a natural product, starting from smaller DNA 
fragments cloned from the genome of a donor organism. This 
reconstruction occurs in an Artificial Chromosome that can 
be maintained in a convenient neutral host, such as the 

3 bacterium Escherichia coli, and subsequently transferred 
into an actinomycete production host. The reconstructed 
genomic segment in the Artificial Chromosome is transferred 
into the production host where said natural product is 
synthesized. 

5 The present invention also relates to Escherichia 

coli-Streptomyces Artificial Chromosomes, recombinant DNA 
vectors useful for shuttling the genetic information 
necessary to synthesize a given natural product between a 
donor actinomycete producer and a production host. 

3 

3.1 DEFINITIONS 

As used herein, the following terms will have the meaning 
indicated. 

An "Escherichia coli-Streptomyces Artificial 

5 Chromosome", . or ESAC, is a recombinant DNA construct that 
can maintain very large DNA inserts in an Escherichia coli 
host and that can be introduced and maintained in an 
actinomycete production host. 

An "Escherichia coli-Streptomyces Artificial 
0 Chromosome" library, or ESAC library, is a library of 
different recombinant constructs carrying very large DNA 
inserts that can be maintained in an Escherichia coli host 
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and introduced and maintained in an actinomycete. production 
host . 

A pESAC is a vector used to construct an "Escherichia 
coli-Streptomyces Artificial Chromosome" or an ESAC library. 
5 A "natural product" is a secondary metabolite made by a 

microorganism through a series of biosynthetic steps. This 
natural product may or may not have any useful biological 
activity. 

A "complex" is the mixture of related natural products 
10 with similar properties and biological activity that are 
often produced by the same biosynthetic pathway. 

A "donor organism" is the original producer of a 
natural product, where the synthesis of said compound is 
governed by a defined number of genetic elements. 
15 A "gene cluster", a "cluster", a "biosynthesis cluster" 

all designate a contiguous segment of the donor organism's 
genome that contains all the genes required for the 
synthesis of a natural product. 

A "production host" is a microorganism where the 
20 formation of a natural product is directed by a gene 
cluster derived from a donor organism. 

As used in the present invention, the following 
abbreviations are employed: °C (Celsius degree) ; h (hour) ; 
min (minute); kb (kilobase) ; jil (microliter); ml 
25 (milliliter) ; mm (millimeter) ; mg (milligram) ; \ig 
(microgram); ng (nanogram); M (molar); Mb (megabase) ; UV 
(ultraviolet); kV (kilovolt) ; Q (Ohm); mFa (millif araday ) . 

In addition, the following abbreviations are used: Ab, 
antibiotic; Ap, ampicillin; attB, chromosomal attachment 
30 site; attP, phage or plasmid attachment site; bp, base 
pair; ca . , circa (i.e. "about"); Cm, chloramphenicol; E. , 
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Escherichia; ESAC, E. coli-Streptomyces Artificial 
Chromosome; GC, guanosine + cytosine; HTS, high throughput 
screening; Km, kanamycin; int , integrase encoding gene; 
LB, Luria Broth; LMP f low melting point; P., Planobispora; 

5 PCR, polymerase chain reaction; PFGE, Pulsed Field Gel 
Electrophoresis; R , resistance; rpm, rounds per minute; S., 
Streptomyces; s , sensitive; Sac, Saccharopolyspora ; sacB, 
gene conferring sensitivity to sucrose/ SDS, sodium dodecyl 
sulfate; Tc, tetacycline; TE r TrisHCl EDTA buffer; tet, 

10 tetacycline resistance gene; Th, thiostrepton; ts, 
temperature sensitive; tsr , thiostrepton resistance gene; 
U f units; vol, volume; wt, weight; YEME, yeast extract malt 
extract medium. 

15 4. BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1. Scheme of the invention. The general concept 
of the invention, whereby the gene cluster required for the 
synthesis of a natural product in a donor organism is 
established as an ESAC in an Escherichia coli host, and 

20 then transferred into a desired production host, where it 
integrates into the chromosome and directs production of 
the secondary metabolite. The hexagon represents the 
natural product, the twisted thin line the bacterial 
chromosomes, and the thick line the desired gene cluster. 

25 The pESAC episome is represented by a circle 

Figure 2. ■ E. coli- Streptomyces Artificial Chromosome 
vectors. Vectors pPAC-Sl and pPAC-S2 differ solely for the 
orientation of the int-tsr cassette. Relevant features of 
the vectors are illustrated. Kmr indicates resistance to 

30 kanamycin; sacB indicates sensitivity to siicrose. Suitable 
cloning sites are shown as: B, BainHI; S, Seal; X, Xbal. The 
replicating function of bacteriophage PI are indicated by 
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the thick bars. 

Figure 3. General scheme of the invention, top-down 
approach. High molecular weight DNA from the donor organism 
is cloned into a pESAC . The resulting library in E. coli is 
5 screened with the required probes, and the relevant ESACs 
are identified. These are introduced into the desired 
production host strain, where they integrate site- 
specifically into the host chromosome. Symbols and 
abbreviations are as in Fig. 1. 

10 Figure 4. General scheme of the invention, bottom-up 

approach. A cosmid library is prepared with DNA from the 
donor organism and screened v/ith the required probes. The 
overlapping inserts from the positive cosmids, which 
consitute the correct contig,are assembled into a pESAC via 

15 homologous recombination in E. coli. The reconstructed ESAC 
is introduced into the desired production host, where it 
integrates site-specifically into the host chromosome. 
Symbols and abbreviations are as in Fig. 1. 

Figure 5. Scheme, of assemblage. The figure illustrates 

20 a hypothetical genomic segment from a donor organism that 
is covered by the inserts from three overlapping clones. 
The relevant fragments A and D, which denote the ends of 
the segment, and B and C, which represent regions of 
overlap, are indicated wi~h their relative orientation 

25 (thick side on the fragment rectangle) . The bottom part 
illustrates the reconstructed ESAC. 

Figure 6. Constructs required for cluster assemblage. 
The plasmids indicated are generated by routine in vitro 
DNA manipulations. Fragments A, B, C and D are as in Fig. 

30 5. Fragment pairs are in this example separated by a 
marker, indicated as Ab p for antibiotic resistance. 
Selective markers present on the two compatible replicons 
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are, as an example: Cm R and Km*. 

Figure 7. Interplasmid insert exchange. Each of the Cm R 
derivatives , as of Fig. 6, is introduced in the same E. 
coli cell as the cognate clone of Fig. 5 (for example a 
5 cosmid that carries a Km R marker) . Formation and then 
resolution of the cointegrate leads to the transfer of the 
cosmid 7 s insert, indicated here by a looping line, in the 
Cm R replicon. 

Figure 8. Sequel of assembling steps. A series of 
10 interplasmid cointegration and resolution events is 
conducted. Only the growing ESAC is indicated. The starting 
construct (Fig. 6) is recombined with plasmid pAB2 (Fig. 
7), leading to the insertion of the segment flanked by 
fragments A and B. Next, the Ab R marker from pBCl (Fig. 6) 
15 is introduced between fragments B and C, and subsequently 
replaced by the insert from pBC2 (Fig. 7) . Finally, the Ab R 
marker from pCDl (Fig. 6) is introduced between fragments C 
and D, and subsequently replaced by the insert from pCD2 
(Fig. 7) . 

20 Figure 9. A gene cluster from Planoblspora rosea. The 

extension of a gene cluster from P. rosea ATCC 53733 is 
reported, together with the cosmids pRP16, pRP31 and pRP58. 
The fragments A, B, C and D used for assemblage are 
highlighted. Restriction sites are abbreviated as: M, Smal; 

25 P, Pstl; S, Sstl. 

Figure 10. Site-specific integration of an ESAC. PFGE 
analysis of S. lividans ZX7 transformed with ESAO70. Lanes 
1 and 2: S. coelicolor M145; lane 3: S. lividans ZX7 DNA; 
lane 4: ZX7 attB: :ESAC-7 0 DNA, colony 1; lane 5: ZX7 

30 attB: :ESAC-70 DNA, colony 2; lane 6: 50-kb ladder, size 
marker. All DNAs in lanes 1-5 are digested with Oral. 
Conditions for PFGE are: 200 Volts, 70 s switching for 7 15 
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h, 120 s switching for 11 h. 

Figure 11. Characterization of 5. llvidans 
transf ormants . Southern hybridization of S. lividans 
attB;;PAD6, grown with (lane 1) or without (lane 2) 
5 thiostrepton . P. rosea DNA is shown as control (lane 3) . 
Lane 4 contains 1-kb ladder. All DNAs are digested with 
BamRI and probed with labeled PAD6 . 

5. DETAILED DESCRIPTION OF THE INVENTION 

10 In its broadest sense, the present invention entails a 
general procedure for constructing a Streptomyces host 
producing any natural product after selective transfer of 
the relevant genes from the original actinornycete producer, 
the donor strain. This general procedure is outlined in 

15 Fig. 1. The present invention can be applied with only 
limited information on the structure of the natural product 
and very little knowledge of the original producer ! s 
genetics. The present invention . has a substantial impact on 
the process of drug discovery involving natural products or 

20 their structural derivatives. The transfer of the producing 
capability to a well characterized host can substantially 
improve several portions of the process of lead 
optimization and development: the titer of the natural 
product in the producing strain can be more effectively 

25 increased; the purification of the natural product can be 
carried out in a known background of possible interfering 
activities; the composition of the complex can be more 
effectively controlled; altered derivatives of the natural 
product can be more effectively produced through 

30 manipulation of the fermentation conditions or by pathway 
engineering. In order to better understand the value of the 
present invention, a brief description is reported below of 
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the current methods for optimizing the productivity of the 
producing strain, for purifying a natural product, for 
controlling the composition of a complex, and for producing 
derivatives of a natural product. 
5 The production of a natural product is controlled by 

several mechanisms, few of which have been established in 
detail. Generally, the level of production of a natural 
product depends on the composition of the growth medium; on 
the presence of appropriate precursors or on the absence of 

10 specific inhibitors; on the expression level and timing of 
genes controlling the biosynthetic pathway and competing 
routes; and on the level and specific activity of key 
enzymes in the pathway. Because of this complexity, the 
productivity of the original strain is usually increased by 

15. an empirical process, which may include, among other 
things, one or more of the following steps: strain 
purification, selection of phenotypic variants arising 
spontaneously or after mutagenic treatment of the strain, 
variation in the fermentation medium or in the fermentation 

20 parameters; genetic engineering of the producing strain. 
Fundamental knowledge about the physiology of the producing 
strain and the variables affecting titer must be achieved 
for an effective improvement of productivity. This 
knowledge is very scant in a newly identified producer 

25 strain. 

During the discovery and development phase, sufficient 
quantities of a natural product must be available for an 
evaluation of its properties and/or for the generation of 
analogs. Because of its uniqueness, a specific purification 
30 process must be developed for each natural product. 
However, it is highly desirable to have the natural product 
as free as possible of compounds that may interfere with 
. the biological activity of the molecule. Contaminating 
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impurities must be characterized analytically and 
biologically. In a poorly characterized producer, little 
information is available on the relevance of contaminating 
impurities. 

5 A natural product may be produced by a microorganism 

as a complex of a few or tens of molecules with minor 
structural differences, designated congeners. Although most 
of the congeners are usually biologically active, only one 
or a few may represent the desired product: for example, 

10 one congener may be substantially more active than the 
others; it may possess better physico-chemical properties; 
or it may be a better substrate for chemical modification. 
The composition of a complex can be somehow controlled by 
intervening on the fermentation parameters. However, the 

15 most effective way is usually the altered expression of 
selected genes by genetic engineering (e.g. Sezonov et al . , 
1997, Nature Biotechnol. 15:349-353). 

Chemical modification of natural products represents 
the most commonly used means of obtaining novel structures. 

20 This approach has been successfully employed, but it still 
suffers from practical limitations to the number and type 
of compounds obtainable. The structural complexity of many 
natural products makes their total synthesis often too 
lengthy and expensive to be of any practical use. This same 

25 structural complexity, with either the presence of several 
closely related functional croups or their absence, limits 
modification of a natural product to selected portions of 
the molecule. Methods of combinatorial synthesis need an 
initial scaffold as the starting building block, and this 

30 can be often generated only through a low yield degradation 
of the natural product. However, derivatives of natural 
products that would be very hard if not impossible to 
produce by chemical means have been obtained after genetic 
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alteration of the biosynthetic pathway. Examples include 
the introduction of additional genetic information (Epp et 
al., 1989, Gene 85:293-301), the targeted inactivation of 
selected genes or portion thereof (Donadio et al., 1993, 
5 Proc. Natl. Acad, Sci. USA 90:7119-7123), the "mixing and 
matching" of genes or portions thereof from different 
pathways (McDaniel et al . , 1994, Nature 375:549-554). 

All the above activities are important for the process 
of lead optimization and for the development of selected 

10 lead structures. They can all benefit, to different extent, 
from a detailed knowledge of the physiology of the 
producing strain, and from the possibility of genetically 
manipulating it. The process by which a given organism is 
genetically manipulated in order to alter the type, quality 

15 or quantity of a natural product is referred lo as pathway 
engineering. The ability to perform pathway engineering in 
a newly isolated microorganism producing a bioactive 
molecule with promising characteristics can therefore 
considerably expedite the optimization of a lead structure 

20 and the development process. Pathway engineering can be 
schematized as a sequel of three steps: a) isolation of the 
genes of interest; b) performing on selected gene(s) the 
manipulations required by the specific objective; and c) 
introduction of the modified gene(s) in suitable form in an 

25 appropriate host. 

Isolation of the genes of interest from most 
actinomycetes can be achieved quite easily. The genes for 
primary metabolism are usually well conserved, and they can 
be easily accessed in any microorganism by using suitable 

30 hybridization probes or • by the PCR. The genetic elements 
governing the biosynthesis of the major classes of 
secondary metabolites have been also described, and many 
genes can similarly be identified. Since natural product 
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biosynthesis is governed by clusters, one needs to identify 
just a few genes in order to have them all. However, 
synthesis of the vast majority of natural products requires 
a considerable extent of genetic information. For examples, 
5 biosynthesis of the natural products erythromycin (an 
antibiotic) , avermectin (an antihelmintic agent) and 
rapamycin (an immunosuppressant) requires 55, 90 and 95 kb, 
respectively, of genetic information (Katz and Donadio, 
1993, Annu. Rev. Microbiol. 47:875-912; MacNeil, 1995, 

10 Avermectins, in Genetics and Biochemistry of Antibiotic 
Production, Vining and Stuttard eds . , Butterworth- 
Heinemann, Boston CT, p. 421-442; Schwecke et al., 1995, 
Proc. Natl. Acad. Sci. USA 92:7839-7843). Other natural 
products may require even larger extent of genetic 

15 information. Therefore, in order to isolate an entire 
cluster in a single piece, cloning vectors capable of 
accepting and maintaining large DNA segments are necessary. 

The manipulation of the isolated genes, is generally 
best performed in a convenient cloning host, such as E. 

20 coli. Manipulations relevant to pathway engineering can 
include some or all of the following: site directed 
mutagenesis, gene inactivation, gene fusions, modification 
of regulatory sequences, etc. Techniques for the in vitro 
manipulation of DNA and for the propagation of the mutated 

25 alleles in E. coli are well developed and can' be applied to 
DNA from virtually any source (Sambrook et al. f 1989, In 
Molecular Cloning: A laboratory Manual, 2nd edn, Cold 
Spring Harbor, New York: Cold Spring Harbor Laboratory 
Press) . 

30 The final step in pathway engineering- requires the 

introduction of modified or heterologous; gene(s), in 
suitable form, in a strain where these genes can be 
appropriately expressed. This strain is often the strain 
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producing the natural product whose quantity, quality or 
type one wants to alter. The genes of interest must be 
carried on appropriate vectors: according to the particular 
objective of pathway engineering, one may need, among 
5 others, vectors that can be stably maintained as single or 
multicopy episomes; that can insert into the host 
chromosome at a fixed location; that allow replacement of 
an endogenous gene with an in vitro modified allele; that 
allow delerion of selected genes from the host chromosome. 

10 In addition, for each strain one must have means for 
introducing heterologous DNA and selecting for its 
presence- Therefore, in order to genetically manipulate a 
given producer, one must establish conditions for rendering 
the bacterial cell capable of receiving incoming DNA; for 

15 selecting the incoming DNA; and develop vectors and 
methodologies for the various types of manipulations 
exemplified above. Low- and high copy-number f integrative, 
non-replicating vectors must be developed with appropriate 
selection markers. Thus, for each producing strain, 

20 specific gene transfer tools and conditions must be 
developed, starting in most cases from extremely poor 
knowledge about the microorganism. In addition, techniques 
developed for one species do not necessarily apply to a new 
species from the same genus, and often not even to a new 

25 strain. It is then no wonder that, among the thousands of 
strains described as producers of interesting natural 
products, gene transfer systems have been developed only 
for a limited number of species, which serve either as 
model organisms for genetic and physiological studies, or 

30 produce a commercially important molecule. The present 
invention provides tools for the general manipulation of 
any secondary metabolite pathway, and overcomes the 
difficulties of developing ad hoc conditions for a new 
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producer . 

Naive hosts have been shown to produce the appropriate 
natural product or its intermediate (s) when the relevant 
DNA was introduced into them (Malpartida and Hopwood, 1984, 
5 Nature 309:462-464; Hong et al., 1997, J. Bacteriol. 
179:470-476; Kao et al., 1994, Science 265:509-512; McGowan 
et al., 1996, Mol . Microbiol. 22:415-426; Kealey et al . , 
1998; Proc. Natl. Acad. Sci. USA 95:505-509). However, the 
examples reported thus far represent special cases. Indeed, 

10 they include the introduction of relatively small DNA 
segments into a production host; or the transfer of gene 
clusters within members of the same bacterial genus; or 
they have required ■ the careful engineering of specific 
biosynthesis genes under the control of appropriate genetic 

15 elements that direct their expression. Furthermore, the 
Streptomyces vectors currently available have an upper 
limit of ca. 40 kb (Hopwood et al . , 1987, Methods Enzymol. 
153:116-167) 

Until now, it was not established that DNA fragments 
20 exceeding 100 kb, derived from the high GC genome of 
actinomycetes, could be cloned and stably maintained in an 
E. coli host. Nor was any report of the introduction of 
large DNA segments into a Streptomyces host. The unexpected 
finding described herein is that these cloning tasks can be 
25 achieved according to the principles and methodologies of 
the present invention. Furthermore, the genetic elements 
required for the synthesis of a natural product in the 
original producer are genetically stable in a heterologous 
host, where they can direct the synthesis of the desired 
30 molecule. It was also unexpected and unprecedented that 
this heterologous stability and expression can occur when 
the donor organism and the production host belong to 
different bacterial genera. 
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The present invention rests on the fact that the genes 
required for the formation of a natural product are found 
as gene clusters of a defined size; that these gene 
clusters can be conveniently isolated, manipulated and 
5 transferred among different actinomycete strains; that they 
are expressed in a heterologous host; and on the fact that 
all the primary metabolite precursors required for the 
formation of a particular natural product are either 
produced by selected enzymes encoded by cluster-specific 

10 genes, or are present and available in the heterologous 
host at the time of formation of the natural product. The 
present invention addresses also the crucial aspect of 
natural product formation in act inomycetes : i.e. synthesis 
of many natural products may require over 100 kb of genetic 

15 information. To be generally applicable, transferring all 
the genes necessary for the production of any natural 
product requires cloning vectors capable of accommodating 
fragments as large as 150 kb, and possibly more. An object 
of the present invention is therefore represented by 

20 vectors capable of accommodating such large fragments which 
are also capable of being stably maintained in a suitable 
microbial host, such as a Streptomyces host. 

Examples of these vectors are designated with the 
generic name pESAC. They are derived from bacterial 

25 artificial chromosomes (Shizuya et al., 1992, Proc. Natl. 
Acad. Sci. USA 89:8794-87 97; Ioannou et al., 1994, Nature 
Genet. 6:84-89)and can carry inserts up to 300 kb, or more. 

As a general example of the broad applicability of the 
principles and methodologies described in the present 

30 invention, the Examples reported below describe how a 
convenient Streptomyces host can be engineered to carry a 
large gene cluster in order to produce a desired natural 
product through the use of an appropriate ESAC. The 
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exemplary organism chosen as the donor organism is the 
actinomycete P. rosea, belonging to one of the lesser 
characterized genera of actinomycetes (Goodfellow, 1992 , In 
The Prokaryotes, 2nd edn., Balows, Trueper, Dworkin, Harder 
5 and Schleifer eds, Springer-Verlag, New York, NY, USA) - 
This organism produces the natural product GE227 0 (Selva et 
al., 1991, J. Antibiotics 44:693-701), an antibacterial 
agent. This particular case therefore describes the general 
applicability of the present invention, since very little 

10 information is available on the donor organism, on its 
genetics and physiology, and on the gene clusters present 
in its genome. Further examples described herein illustrate 
the application of the principles and methodologies of the 
present invention to other gene clusters described in the 

15 literature. 

The present invention, relating to a general method 
for transferring the capability to produce any natural 
product from the original actinomycete to an established 
and genetically manipulatable S.treptomyces host, can be 

20 schematized in a series of passages summarized as: 1) 
design of suitable vectors; 2) construction of a large- 
insert library in said vectors; 3) selection of the desired 
clones with appropriate probes; 4) insertion of the 
selected clones into a convenient Streptomyces host; and 5) 

25 growth of the recombinant strain under appropriate 
conditions to produce the natural product. 

Actinomycetes produce a large number of natural 
products with important applications. However, other 
important classes of microbial producers are known, and 

30 newer ones are likely to be discovered in the upcoming 
years, as more microbial sources are screened for potential 
new drugs. Important classes of microbial producers 
include, among others, filamentous fungi, bacilli, 
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mixobacteria, pseudomonas and cyanobacteria . The series of 
passages described above can therefore be applied to other 
important classes of microbial producers, provided that two 
requisites are met: the synthesis of the desired natural 
5 product is governed by a gene cluster; suitable production 
host(s) exist; and appropriate selective marker (s) and 
maintenance function (s) are introduced into the Artificial 
Chromosome. 

Furthermore, the series of passages summarized above 

10 and described in detail in the Examples, involve the use of 
a neutral cloning host. This host, as described in the 
present invention, is the bacterium Escherichia coli. In a 
preferred example of such a host, a high cloning efficiency 
can be obtained, and many of the analyses of the ESACs can 

15 be quickly performed. However, it is evident to one of 
ordinary skill in this art that any other host that allows 
high cloning efficiency can be used as neutral cloning 
host. Additionally, the use of such a host is not a 
conditio sine qua hon for the applicability of the present 

20 invention. In fact, when it is possible to establish 
directly a library in a production host, there is no need 
for an intermediate neutral cloning host. 

In summary, the present invention consists of a method 
for transferring the production of a natural product from 

25 an actinomycete donor organism that is the original 
producer of said natural product to a different 
actinomycete host, where this transfer is achieved by means 
of an ' E. coli-Streptomyces Artificial Chromosome that 
carries' a gene cluster governing the biosynthesis of said 

30 natural product derived from said donor organism. This 
method comprises the steps of : 

(a) isolating large fragments of chromosomal DNA of 
the actinomycete donor organism of a size which encompasses 
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the gene cluster that directs the biosynthesis of the 
natural product; 

(b) constructing a suitable vector capable of 
accomodating said large fragments of chromosomal DNA and of 

5 introducing and stably maintaining said large fragments of 
DNA into an E. coli host; 

(c) constructing an E. coli-Streptomyces Artificial 
Chromosome by inserting said large fragments of chromosomal 
DNA of step (a) into the above said vector of step (b) and 

10 selecting the E. coli-Streptomyces Artificial Chromosome 
comprising the entire gene cluster construct that directs 
the biosynthesis of the above said natural product; 

(d) transforming an actinomycete host different from 
the donor actinomycete host with the E. coli-Streptomyces 

15 Artificial Chromosome of step (c) that carries the gene 
cluster governing the biosynthesis of said natural product 
wherein the actinomycete host carries a region which is 
specific for the integration of the E . coli-Streptomyces 
Artificial Chromosome. 

20 

6. GENERAL METHODS 

Plasniids, Bacterial Strains and Growth Conditions 
Plasmids pUCBM20, pUCBM21 f pBR322 and pUC18 are obtained 
from Boheringer Mannheim; plasmid pIJ39 and <t>C31 DNA have 

25 been described (Hopwood et' al., 1985, Genetic Manipulation 
of Streptomyces : A Laboratory Manual, The John Innes 
Foundation, Norwich, UK) and are available from prof. David 
Hopwood, The John Innes . Centre, Norwich, UK; plasmid 
pCYPAC2 has been described (loannou et al., 1994, Nature 

30 Genetics 6:84-89) and is available from prof. Pieter de 
Jong, Roswell Park Cancer Institute, Buffalo, NY, USA; 
plasmid pMAK705 has been described (Hamilton, et al . , 1989, 
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J. Bacterid, 171:4617) and is available from prof. Sidney 
Kushner, University of Georgia, Athens, USA; cosmid Lorist6 
has been described (Gibson et al., 1987, Gene, 53:283-286) 
and is from prof. Stewart Cole, Pasteur Institute, Paris, 

5 France. E. coli strains are obtained from commercial 
sources: DH5a (Life Technologies), DH10B (Life 
Technologies), C600 {E. coli Genetic Stock Center), DH1 
(Life Technologies) and XLlblue (Stratagene) . S. coelicolor 
M145 and S. lividans ZX7 have been described (Hopwood et 

10 al., 1985, Genetic Manipulation of Streptomyces : A 
Laboratory Manual, The John Innes Foundation, Norwich, UK) 
and are available from prof. David Hopwood, The John Innes 
Institute, Norwich, UK. Planobispora rosea ATCC 53733, 
Streptomyces hygroscopicus ATCC 29253, Amycolatopsis 

15 mediterranei ATCC 13685 and Saccharopolyspora erythraea 
NRRL2338 are from the ATCC culture collection. All other 
materials are from commercial sources. Media for 
cultivation of E . coli (Sambrook et al., 1989, In Molecular 
Cloning: A laboratory Manual, 2nd edn, Cold Spring Harbor, 

20 New York: Cold Spring Harbor Laboratory Press) and 
Streptomyces (Hopwood et al . , 1985, Genetic Manipulation of 
Streptomyces : A Laboratory Manual, The John Innes 
Foundation, Norwich, UK) have been described. The JM medium 
for S. coelicolor has been described (Puglia et al., 1995, 

25 Mol. Microbiol. 17:737-746). 

DNA Manipulations DNA manipulations are performed 
following described procedures, using the appropriate E. 
coli strains as cloning hosts (Sambrook et al., 1989, In 
Molecular Cloning: A laboratory Manual, 2nd edn, Cold 

30 Spring Harbor, New York: Cold Spring Harbor Laboratory 
Press) . Genomic DNA from actinomycetes is prepared as 
described (Hopwood et al., 1985, Genetic Manipulation of 
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Streptomyces: A Laboratory Manual, The John Innes 
Foundation, Norwich, UK) . A cosmid library of P. rosea DNA 
is constructed in the cosmid vector Lorist6 following 
published procedures (Sambrook et al., 1989, In Molecular 
5 Cloning: A laboratory Manual, 2nd edn, Cold Spring Harbor, 
New York: Cold Spring Harbor Laboratory Press) . 
Amplification by the PCR are performed following published 
guidelines (Innis, Gelfand, Sninsky and White, eds . , 1990, 
PCR Protocols: A guide to Methods and Applications, 

10 Academic Press, San Diego, CA, USA) . 

Hybridizations Probes Pep6 and Pep8 are derived from 
conserved motifs in peptide synthetase gene sequences 
(Turgay and Marahiel, 1994, Pept. Res. 7:238-241). 
Oligonucleotide probe Pep6 consists of an equimolar mixture 

15 of 5 ' -GCSTACATCATCTACACSTCSGGSACSACS-GGSAAGCCSAAGGG- 

3 f (SEQID N°l) and 5 f - 

GGSTACATCATCTACACSAGCGGSACSACSGGSAAGCCSAAGGG-3 1 (SEQID N°2) . 
Oligonucleotide probe Pep8 consists of an equimolar mixture 
of 5 ' -AKGCTGTCSCCSCCSAGSNNGAAG- AAGT YGTCGTCGATSCC- 3 1 {SEQID 

20. N°3) and 5 1 -AKGGAGTCSCCSCCSAGSNNGAAGAAGTYGTCGTCGATSCC- 
3 1 (SEQID N°4). [S indicates G or C; K indicates G or T; Y, 
C or T; and A T , any base] . Hybridizations are performed with 
a hybridization stringency set at 2xSSC, 55 °C, and a final 
wash set at the same stringency. 

25 Preparation of high molecular weight DNA Procedures for 

the preparation of high molecular weight DNA from 
actinomycetes for PFGE have been described (Dyson, 1993, 
Trends Genet. 9:72; Kieser et al., 1992, J. Bacteriol. 
174:5496-5507). They are modified for constructing 

30 libraries as described in the Examples. 

7 . EXAMPLES 
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The present invention consists in a series of passages, 
involving the design of suitable vectors; the introduction 
of large DNA inserts in said vectors employing genomic DNA 
from the donor organism; the selection of clones carrying 
5 the cluster specifying the synthesis of the desired natural 
product; the introduction of selected clone (s) into the 
appropriate production host; and the growth of the 
recombinant strain under appropriate conditions for 
metabolite production. These passages are described in 

10 detail in the Examples reported herein. These Examples 
outline the steps necessary to accomplish each passage, for 
the overall purpose of the present invention: the 
production of a natural product in a different host. They 
serve to illustrate the principles and methodologies of the 

15 present invention, and are not meant to restrict its scope 
to the Examples specified herein. 

7 . 1 Cloning vectors 

Bacterial Artificial Chromosomes are circular plasmids that 
20 can be easily propagated in and prepared from E. coli cells 
by standard miniprep methods (Shizuya et al., 1992, Proc. 
Natl. Acad. Sci . USA 89:8794-8797; Ioannou et al., 1994, 
Nature Genet. 6:84-89). In order to adapt Bacterial 
Artificial Chromosomes to a Streptomyces host, they need to 
25 be endowed with a selectable marker and maintenance 
functions. Site-specific integration, mediated by the 
action of an integrase encoded by the int gene, allows the 
stable incorporation of episomal elements inzo the host 
genome, at a defined locus designated attB. The episomal 
30 element needs to carry the cognate attP site and it may 
lack replicative functions. In addition, int-mediated 
excision of the integrated element from the chromosome via 
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reversal of the integration event can be prevented through 
selection of the resistance marker carried by the 
integrated episome; or, if necessary, after site-specific 
integration has occurred, the int gene on the integrated 
5 episome can be inactivated. Site-specific integration 
therefore allows the introduction of foreign DNA in single 
copy at a defined genetic locus. Several systems capable of 
directing site-specific integration of incoming circular 
DNA into the chromosome of a Streptomyces host have been 

10 described. A convenient system that can be used in the 
present invention is for istance the int-attP system 
derived from the temperate bacteriophage OC31 (Kuhstoss and 
Rao, 1991, J. Mol. Biol. 222:897-908), which directs, 
during lysogen formation, integration of the 41-kb phage 

15 genome at the attB site, located in a stable segment of the 
S. coelicolor chromosome (Redenbach et al., 1996, Mol. 
Microbiol. 21:77-96). Several selectable markers have been 
described that can be used for Streptomyces (Hopwood et 
al., 1985, Genetic Manipulation of Streptomyces: A 

20 Laboratory Manual, The John Innes Foundation, Norwich, UK) . 
The tsr gene, conferring resistance to the antibiotic 
thiostrepton (Thompson et al., 1982, Gene 20:51-62), is 
used in the present invention. The pESAC vectors, pPAC-Sl 
and pPAC-S2, described in the present invention, are 

25 depicted in Fig. 2. Their relevant features are: ability to 
accommodate DNA inserts up to 300 kb; low copy number in E. 
coli for increased stability; ease of propagation in E. 
coli because of the pUC19 staffer segment; BamHI, Xbal or 
Seal cloning sites, with positive selection of inserts for 

30 resistance to sucrose; T7 and SP6 promoters flanking the 
cloning site; Km R or Th R for selection in E. coli or 
actinomycetes, respectively; site-specific integration at 
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the OC31 attB site into the Streptomyces genome. Vectors 
pPAC-Sl and pPAC-S2 are 22 kb in size and differ solely for 
the orientation of the int-tsr cassette. After release of 
the stuffer pUC19 segment, the vector size is reduced to 
5 19.7 kb. When cloning in the BamRI site, the vector can be 
released by digestion with Dral, resulting in vector 
fragments of 7.4, 4.2 and 0.6 kb . The additional 7.5 kb of 
. vector DNA will be associated with the insert. Dral rarely 
cuts in the high-GC genome of actinomycetes, so that the 
10 insert size can be easily calculated. 

Example 1 

Isolation of the int region from OC31 

Two pairs of PCR primers, 5 1 -TTTTTGGTACCTGACGTCCCGAAGG- 
15 CGTG-3 1 (SEQID N°5) and 5 ' -CAGCTTGTCCATGGCGGA-3 ' (SEQID 
N°6); and 5 1 -TCTGTCCGCCATGGACAAGC-3 ' (SEQID N°7) and 5'- 
TTTTTGGATCCGGCTAACTAACTAAACCGAGA-3 1 (SEQID N°8), are used 
to amplify the int-containing fragments of 1.3 and 0.9 kb, 
respectively. The template is <&C31 DNA. The amplified 
20 fragments are digested with Kpnl + Ncol and Ncol + BamHI, 
respectively, and recovered from an agarose gel. 

Example 2 

Construction of plasmid pINT 
25 The 1.3 and 0.9 kb fragment, prepared as described in 
Example 1, are ligated to pQCMB21, digested with Kpnl + 
BamHI. The resulting mixture contains the desired plasmid 
pINT. 

30 Example 3 

Construction of E. coll K12 DH5a/pINT 

Approximately 10 ng of plasmid pINT, prepared as described 



SUBSTITUTE SHEET (Rule 26) 



WO 99/67374 



PCT/EP99/04079 



27 

in Example 2, are used to transform E. coli DH5a and a few 
of the resulting Ap R colonies that appear on the LB-agar 
plates are analyzed for their plasmid content. One colony 
is found to carry pINT, as verified by the observation, 
5 upon agarose gel-electrophoresis, of fragments of 4.0 and 
0.9 kb after digestion of the plasmid with Ncol + BamEI . 

Example 4 

Construction of plasmids pUITl 
10 The 1.8 kb BamEI fragment containing the tsr gene is 
isolated from pIJ39 and ligated to pINT, prepared as 
described in Example 3 and previously digested with BajnHI. 
The resulting mixture contains the desired plasmids pUITl. 

15 Example 5 

Construction of E. coli K12 DH5a/pUITl 

Approximately 10 ng of plasmid pUITl, prepared as described 
in Example 4, are used to transform E. coli DH5a and a few 
of the resulting Ap R colonies that appear on the LB-agar 
20 plates are analyzed for their plasmid content. One colony 
is found to carry pUITl, as verified by the observation, 
upon agarose gel-electrophoresis, of fragments of 4.9 and 
1.8 kb after BamEI digestion of the plasmid. 

25 Example 6 

Construction of plasmid pUIT3 

The 3.7 kb Apal fragment, containing the int-tsr cassette, 
is isolated from plasmid pUITl, prepared as described in 
Example 5, and ligated to pUCBM21 digested with Apal. The 
30 resulting mixture contains the desired plasmid pUIT3. 

Example 7 
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Construction of E. coli K12 DH5 a /pUIT3 

Approximately 10 ng of plasmid pUIT3, prepared as described 
in Example 6, are used to transform E. coli DH5a and a few 
of the resulting Ap R colonies that appear on the LB-agar 
5 plates are analyzed for their plasmid content. One colony 
is found to carry pUIT3, as verified by the observation, 
upon agarose gel-electrophoresis, of fragments of of 4.2 
and 2.2 kb after BamHI digestion of the plasmid. 

10 Example 8 

Construction of plasmid pUIT4 

The BamHI site present in the int-tsr cassette of plasmid 
pUIT3 is eliminated as follows. Plasmid pUIT3, prepared as 
described in Example 7, is partially digested with BamHI, 
15 followed by filling-in of the resulting ends, and treated 
with DNA ligase. The resulting mixture contains the desired 
plasmid pUIT4 . 

Example 9 

20 Construction of E. coli K12 DH5 a /pUIT4 

Approximately 10 ng of plasmid pUIT4, prepared as described 
in Example 8, are used to transform E. coli DH5a and a few 
of the resulting Ap R colonies that appear on the LB-agar 
plates are analyzed for their plasmid content. One colony 

25 is found to carry pUIT4 f as verified by the observation, 
upon agarose gel-electrophoresis, of a 6.4 kb fragment 
after BamRI digestion of the. plasmid. 

Example 10 

30 Construction of plasmid pPAC-Sl and pPAC-S2 

The 3.7 kb Apal fragment from pUIT4, prepared as described 
in Example 9, is mixed with pCYPAC2, previously digested 
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with Nhel. After filling-in of the ends, DNA ligase is 
added. The resulting mixture contains the desired plasmids 
pPAC-Sl and pPAC-S2. 

5 Example 11 

Construction of E. coli K12 DHlOB/pPAC-Sl and DH10B/pPAC-S2 
Approximately 10 ng of plasmids pPAC-Sl and pPAC-S2, 
prepared as described in Example 10, are used to transform 
E. coli DH10B and a few of the resulting Km R colonies that 

10 appear on the LB-agar plates are analyzed for their plasmid 
content. One colony is found to carry pPAC-Sl, as verified 
by the observation, upon agarose gel-electrophoresis, of 
fragments of 8.1, 4.8, 4.6, 2.2, 2.2, 0.5 and 0.1 kb after 
EcoRI digestion of the plasmid. Another colony is found to 

15 carry pPAC-S2, as verified by the observation, upon agarose 
gel-electrophoresis, of fragments of 8.1, 7.8, 2.2, 2.2, 
1.5, 0.5 and 0.1 kb after ScoRI + BamRI digestion of the 
plasmid. 

20 Although the present invention is described in the 

Examples listed above in terms of preferred embodiments, 
they are not to be regarded as limiting the scope of the 
invention. The above Examples serve to illustrate the 
principles and methodologies for constructing Bacterial 

25 Artificial Chromosomes that can be introduced in a 
Streptomyces host. It will occur to those skilled in the 
art that selectable markers different from the tsr gene can 
be employed for selection in Streptomyces. Other useful 
markers are described in detail in laboratory manuals 

30 (Hopwood et al., 1985, Genetic Manipulation of 
Streptomyces: A Laboratory Manual, The John Innes 
Foundation, Norwich, UK) and include but are not limited 
to: genes conferring resistance to apramycin, kanamycin, 

SUBSTITUTE SHEET (Rule 26) 



WO 99/67374 



PCT/EP99/04079 



30 

erythromycin, hygromycin, viomycin. It will also occur to 
those skilled in the art that functions other than those 
specified by 3>C31 can be used for directing site-specific 
integration in the Streptojnyces chromosome. These functions 
5 are described in recent literature (Hopwood and Kieser, 
1991, Methods Enzymol. 204:430- 458) and include but are 
not limited to those derived from pSAM2, SLP1, IS117. 
Bacterial Artificial Chromosomes derived from the E. coli F 
plasmid have been described (Shizuya et al . , 1992, Proc. 

10 Natl. Acad. Sci. USA 89:8794-8797). It will occur to those 
skilled in the art that, using the principles, and 
methodologies described above, the int-tsr cassette from 
pUIT4, prepared as described in Example 9, could be 
inserted into a unique site of pBAC108L (Shizuya et al . , 

15 1992, Proc. Natl. Acad. Sci. USA 89:8794-8797) or of 
suitable derivatives of this vector, leading to the 
formation of a BAC-based series of pESAC. It will occur to 
those skilled in the art that other pESACs differing, for 
example, in their size, in the E. coli replicon they carry, 

20 in the selectable marker for E. coli, in the cloning sites, 
can also be used in the present invention. Other 
differences and variations in the technical aspects of the 
present invention could be employed. These include but are 
not limited to: different methods and sources for 

25 obtaining selectable markers and integrative functions; 
different cloning sites and methodologies; different 
orientation of the insert; different E. coli hosts for 
amplifying the recombinant constructs. All these variations 
fall within the scope of the present invention, 

30 

7 . 2 Construction of large Inserts in pESAC 

Two distinct methodologies for introducing large DNA 
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fragments into the vectors described in Section 7.1 fall 
within the scope of the present invention. The first 
methodology can be referred to as the top-down approach and 
is depicted in Fig. 3. It consists of directly cloning the 
5 desired gene cluster into a pESAC through the construction 
of a genomic library of DNA fragments of average size of 
100 kb, or more. The library is then screened with suitable 
probes (Section 7.3) in order to identify the desired 
cluster. The second methodology can be considered a bottom- 

10 up approach and is illustrated in Fig. 4. It consists of 
assembling the desired gene cluster from pre-existing 
smaller segments of cloned, overlapping DNA, through the 
iterative use of homologous recombination in E. coll. The 
desired overlapping clones encompass the desired gene 

15 cluster and are identified as described in Section 7.3. 
Both methodologies fall within the scope of this invention. 
Depending on factors such as previous knowledge about the 
biosynthesis cluster, the extent of characterization of the 
producing strain, the existence of other natural products 

20 of interest produced by the original microorganism, one 
methodology may be preferred over the other. However, the 
two methodologies are not mutually exclusive. 

7.2.1 Preparation of a large insert library 

25 In order to prepare a large-insert library, particular care 
must be taken in the preparation of genomic DNA from the 
actinomycete strain of choice. Although several procedures 
have been described for the isolation of genomic DNA, few 
are suitable for obtaining sufficient yields of high 

30 molecular weight DNA. The strain of choice is grown in a 
medium that allows dispersed growth to facilitate lysis of 
the cells. Examples of suitable growth media for different 
genera of actinomycetes can be found in the literature 
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(Balows, Trueper, Dworkin, Harder and Schleifer eds., 1992, 
The Prokaryotes, 2nd edn., Springer-Verlag, New York, NY, 
USA) . The growth time should allow formation of a 
sufficient quantity of biomass; however, long incubation 
5 times should be avoided, since mycelia are generally more 
resistant to lysis as they age. The mycelium is pelleted, 
washed and embedded in agarose for the subsequent lytic 
steps. Lysis of the cells is achieved by a combination of 
enzymatic (e.g., incubation with lysozyme and/or 

10 achromopeptidase) and mild physical treatments (e.g., SDS) . 
The concentrations of reagents and the incubation times 
need to be optimized for each strain. A good starting point 
is represented by the conditions described in Example 12. 
The quality of the DNA preparation is checked by PFGE under 

15 appropriate conditions. Once a suitable preparation is 
obtained, the DNA can be digested as described in Example 
13. The exact incubation time and the units of restriction 
endonuclease are adjusted to the particular DNA preparation 
for optimizing the size and yield of the bulk of digested 

20 DNA, which should exceed 150 kb. The partially digested DNA 
is size-fractionated on a PFGE gel, without exposure to 
ethidium bromide or UV light, in order to avoid damage to 
the DNA. The gel slice containing the desired DNA fraction 
is localized by staining- the marker-containing portion of 

25 the gel and cut. All subsequent manipulations are performed 
with great care (Birren and Lai, 1993, Pulsed Field Gel 
Electrophoresis: A Practical Guide, Academic Press, New 
York, NY) . The size-selected DNA is licated to an 
appropriately prepared pESAC (see Example 14) employing a 

30 high molar excess of vector to insert (ca. 10:1) in order 
to minimize the formation of chimeric clones (i.e. those 
constituted by the religation of two uncontiguous inserts) . 
Subsequent steps are performed using published procedures 
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for the cloning in Bacterial Artificial Chromosomes, as 
described in Examples 16 and 17. 

The genome size of actinomycetes is around 8 Mb. 
Consequently, a 10-genome equivalents library consisting of 
5 800 clones with an average insert size of 100 kb has >99.9% 
probability of containing the desired clone (Sambrook et 
al., 1989, In Molecular Cloning: A laboratory Manual, 2nd 
edn, Cold Spring Harbor, New York: Cold Spring Harbor 
Laboratory Press) . Therefore, the average clone in the 

10 library will have a 10-kb segment (8,000 kb divided by 800 
clones = 10 kb/clone) of unique DNA , i.e. DNA not found in 
any other clone. Consequently, a 90 kb cluster will have a 
high chance of being exactly contained within one or two 
100-kb clones in a 800-clone library. The number of clones 

15 to be screened and the average insert size to be looked for 
in the ESAC library depends on the expected size of the 
biosynthesis gene cluster. The larger the difference 
between the average insert size and the expected size of 
the gene cluster, the smaller the number of clones to 

20 screen in order to identify an entire gene cluster in a 
single clone. ESAC DNA is prepared from a representative 
number of clones obtained after electroporation of a 
ligation mixture and analyzed for determining the frequency 
of insert-carrying clones and their average size. If 

25 necessary, all insert containing clones can be analyzed by 
miniprep procedure (Birren and Lai, 1993, Pulsed Field Gel 
Electrophoresis : A Practical Guide, Academic Press, New 
York, NY, USA) and clones carrying inserts below a certain 
threshold can be discarded. Alternatively, the number of 

30 clones carrying insert of the appropriate size can be 
estimated after analysis of a representative number of 
ESACs. The quality of the library can be evaluated by 
probing with cloned genes from the strain (if available), 
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or from highly conserved "housekeeping" genes from a strain 
with a similar GC content, such as S. coelicolor. 

Example 12 

5 Preparation of high molecular weight chromosomal DNA 

S. coelicolor strain M145 is grown in YEME medium 
containing 0.5% (wt/vol) glycine for 40 h at 30°C on an 
orbital shaker (ca. 200 rprn) . The mycelium is pelleted by 
centrif ugation r washed with 10.3% sucrose and the 
10 chromosomal DNA is extracted from the mycelium embedded in 
0.75% LMP agarose by treatment with 1 nig /ml lysozyme and 
with 1 mg/nl proteinase K in 0.1% SDS for 40 h at 50°C. 

Example 13 

15 Preparation of partially digested chromosomal DMA 

S, coelicolor Ml 4 5 chromosomal DNA, prepared as described 
in Example 12 and embedded in LMP agarose plugs, is 
partially digested by limiting the magnesium concentration 
for 20 min with 4 U of Sau3AI. The resulting DNA fragments 

20 are resolved by PFGE and the size-selected genomic DNA 
fraction (larger than 100 kb) is recovered and released 
from the agarose gel by digestion with gelase. 

Example 14 

25 Preparation of pPAC-Sl for library construction 

The vector pPAC-Sl, prepared as described in Example 11, is 
cut with Seal and then treated with calf intestinal 
phosphatase. The recovered DNA is then digested with BamHI 
and treated with an excess of calf intestinal phosphatase. 

30 The short Scal-BamHI linker fragments are removed by spin 
dialysis. 
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Example 15 

Construction of the SSAC library 

Size selected genomic DNA, prepared as described in Example 
13, is ligated to pPAC-Sl, prepared as described in Example 
5 14, employing 300 Molecular Biology Units of T4 DNA ligase 
in a 50 [il final volume and using a ca . 10:1 molar ratio of 
vector to insert. The resulting ligation mixture contains 
the desired ESAC- library, consisting of fragments S. 
coellcolor DNA inserted into the pPAC-Sl vector. 

10 

Example 16 

Introduction of the library into E. coli K12 PHI 0B 

The ligation mixture, prepared as described in Example 15, 

is drop-dialyzed against 0.5 X TE for 2 h using 0.025 mm 

15 type VS membranes (Millipore) and a few ul are used to 
electroporate 40 u.1 of electrocompetent £. coli DH10B 
cells. The electroporation conditions are: 2.5 kV, 100 Q 
and 25 mFa employing the Biorad Gene Pulser II. The cells 
are plated on LB-agar plates containing 25 |xg/ml Km and 5% 

20 sucrose to select for recombinant cells harboring insert- 
carrying pPAC-Sl. Individual colonies are picked into 0.1 
ml of LB broth containing 25 u,g/ml Km in 9 6-well microtiter 
plates, where they are stored at -80 °C after overnight 
incubation and addition of glycerol to 20% (v/v- . 

25 

Example 17 

Preparation of recombinant ESACs 

Individual colonies, prepared as described in Example 16, 
are inoculated into 5 ml of LB broth containing 25 |ig/ml Km 
30 and grown overnight. ESAC ; DNA is isolated using the 
alkaline extraction procedure (Sambrook et al . f 1989, In 
Molecular Cloning: A laboratory Manual, 2nd edn, Cold 
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Spring Harbor, New York: Cold Spring Harbor Laboratory 
Press)) without the phenol extraction step. The DNA is 
analyzed, after digestion with Dral, by PFGE. Three bands 
of 7.4, 4.2 and 0.6 kb are common to all clones and 
5 represent vector DNA. 

The examples described above illustrate the principles 
and methodologies of constructing a large-insert library of 
S. coelicolor DNA in a pESAC . Although the present 

10 invention is described in the Examples listed above in 
terms of preferred embodiments, they are not to be regarded 
as limiting the scope of the invention. The above 
descriptions serve to illustrate the principles and 
methodologies for constructing a large-insert DNA library 

15 in a pESAC. It will occur to those skilled in the art that 
other Streptomyces strains can be used as a source of DNA 
for constructing the library. For example, an ESAC library 
of the rapamycin producer Streptomyces hygroscoplcus ATCC 
29253 can be constructed, employing the procedures reported 

20 for PFGE analysis (Ruan et al., 1997, Gene 203:1-9) and 
applying the principles and methodologies described in 
Examples 12 through 17. 

It will also' occur to those skilled in the art that 
strains from actinomycete genera other than Streptomyces 

25 can be used as a source of DNA for constructing an ESAC 
library. These strains can belong to any genus of the order 
Actlnomycetales, which include but are not limited to the 
genera reported in Table 1. As another example, an ESAC 
library of the erythromycin producer Saccharbpolyspora 

30 erythraea can be constructed, employing the procedures 
reported for PFGE analysis (Reeves et al;. , 1998, 
Microbiology 14 4:2151-2159) and applying the principles and 
methodologies described in Examples 12 through 17. Those 
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skilled in the art understand that bacterial taxonomy is a 
rapidly evolving field and new genera may be described 
while old genera may be reclassified. Therefore,, the list 
of bacteria genera related to actinomycetes is very likely 
5 to change. Nonetheless, the principles and methodologies of 
the present invention can be applied to any donor organism 
related to the actinomycetes. 

It will also occur to those skilled in the art that 
different actinomycete strains will require growth media 

10 different from those reported in Example 12. Furthermore, 
alternative media and conditions for growth can be employed 
for obtaining mycelia for DNA preparation; that alternative 
methods of lysis of mycelia can be utilized; that 
restriction endonucleases other than Sau3AI can be equally 

15 effective for constructing a library; that other methods 
for fragmenting DNA can be employed. In addition, it will 
occur to those skilled in the art that pESAC other than 
pPAC-Sl, which include but are not limited to the possible 
vectors described in Section 7.1, can be used for 

20 constructing a library. Alternative methods for ligating 
DNA f for introducing the library in E. coli cells, and 
hosts other than DH10B are well described in the literature 
and can be employed in the present invention. All the above 
variations in strains, reagents and methodologies that can 

25 be employed for preparing a large-insert library of 
actinomycete DNA into a pESAC fall within the scope of the 
present invention . 
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Table 1 

List of exemplary genera of Actinomycetales 



Acidothermus 


Cellulomonas 


Kineococcus 


Actinobispora 


Chainia 


Kineosporia 


Actinocorallia 


Clavibacter 


Kitasatoa 


Act inokineospora 


Coriobacterium 


Kitasatosporia 


Actinomadura 


Corynebacterium 


Kocuria 


Actinomyces 


Couchioplanes 


Kutzneria 


Actinoplanes 


Cryoba c teri um 


Kytococcus 


Act inopoly spora 


Curtobacterium 


Lentzea 


Act in opycn i di um 


Da c tyl ospora n gi um 


Luteococcus 


Actinosporangim 


Demetria 


Mi croba cterium 


Actinosynnema 


Dermabacter 


Mi crobi spora 


Aeromicrobium 


Dermacoccus 


Micrococcus 


Agrococcus 


Dermatophilus 


Microellobosporia 


Agromyces 


Dietzia 


Microlunatus 


Ampullar iella 


Elytrosporangium 


Micromonospora 


Amycolata 


Excellospora 


Microsphaera 




Exiauobacteri um 


Micro r a spora 


Arcanobacterium 


FranJcia 


Microthrix 


Arthrobacter 


Friedmann iella 


Mobiluncus 


Atopobium 


Gardnerella 


Mycoba c teri um 


Aureobacteri um 


Geodermatophilus 


Nesterenkonia 


Bifidobacterium 


Glycomyces 


Nocardia 


Blastococcus 


Gordona 


Nocardioides 


Bogoriella 


Herbidospora 


Nocardiopsis 


Bra chyba c t eri um 


In t ra spora ngium 


Oerskovia 


Bre viba c t eri um 


Janibacter 


Pelczaria 


Catellatospora 


Jonesia 


Ph enyl oba cterium 


Catenuloplanes 


Kibdelosporangi um 


Pilimelia 
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Plmelobacter 


Rhodococcus 


Streptomyces 


Planobispora 


Rothia 


Streptosporangium 


Planomonospora 


Rubrobacter 


Streptovertlcilllum 


Planopolyspora 


Saccharomonospora 


Terrabacter 


Piano tetaspora 


Saccharopoly spora 


Terracoccus 


Prauseria 


Saccharothrix 


Thermoactinomyces 


Prom icromon o spora 


Sanguibacter 


Thermocri spurn 


Propi on iba cterium 


Skermania 


Thermomonospora 


Prop ion i fera x 


Spirilliplanes 


Tropheryma 


Pseudonocardia 


Spirillospora 


Tsukamurella 


Rarobacter 


Sporichthya 


Turicella 


Rathayibacter 


Stomatococcus 




Renlbacteri um 


Streptoalloteichus 
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7.2.2 Assemblage by homologous recombination 

The bottom-up strategy of assembling large fragments from a 
set of pre-existing smaller segments of partially 
overlapping DNA cloned from the genome of the actinomycete 
5 donor organism, is described in this section. This 
methodology makes use of the same pESAC described in the 
present invention under Section 7.1. The desired cluster is 
assembled from existing partially overlapping clones by the 
iterative use of homologous recombination in E. coli. In 

10 the example of Fig. 5, three overlapping clones, designated 
1, 2 and 3, and derived from the genome of a donor 
organism, encompass the desired biosynthesis cluster. These 
clones include leftward fragment "A" unique ~o clone 1; 
fragment "B" common to clones 1 and 2; fragment W C" common to 

15 clones 2 and 3; and rightward fragment "D" unique to clone 
3. These fragments can range from a few hundred bp to a few 
kb, and are thus amenable to routine in vitro . DNA 
manipulations. The number of overlapping clones 
encompassing the cluster may vary. However, if n is the 

20 number of overlapping clones that cover the desired genomic 
segment, the number of fragments to consider will be equal 
to n + 1. In the example illustrated in Fig. 5, four 
fragments are required. The cluster of Fig. 5 is 
reconstructed into a pESAC through the use of successive 

25 rounds of homologous recombination in E. coli. Fragments A 
and B are cloned in a ts vector, as shown in Fig. 6, which 
carries a selectable marker, Cm R as exemplified in Fig. 6. 
The same is done with fragment pairs B-C and C-D (Fig. 6) . 
The relative orientation of each fragment pair in the ts 

30 vector must be the same as in the gene cluster. The 
fragments in each pair may be separated by a selectable 
marker, designated Ab R in Fig. 6, to monitor interplasmid 
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insert exchange. Therefore, three constructs in the ts 
vector, designated pABl, pAB2 and pAB3, are required. The 
A-B-C-D four-fragment cassette is cloned in a pESAC (Fig. 
6) . The relative orientation of the four fragments in the 
5 pESAC must be the same as in the gene cluster. Again, a 
selectable marker may separate any of two fragments to 
monitor interplasmid insert exchange. The original clone 
(for example, a cosmid, which carries a selectable marker, 
Km R as exemplified in Fig, 7) containing part of the cluster 

10 and the cognate ts construct (Fig. 7) are introduced into 
the same E. coli cell. The interplasmid cointegrate between 
the original clone and the ts construct is selected at the 
non-permissive temperature for the ts replicon. This occurs 
through a single, reciprocal homologous recombination 

15 mediated by either one of the two fragments in the A-B, B-C 
or C-D pairs. The cointegrate is then resolved at the 
permissive temperature, leading to insert exchange between 
the two replicons (Fig. 7) . The presence in the ts replicon 
of the genomic segment comprised between fragments A and B 

20 can be monitored by the appearance of Cm R Ab s colonies. This 
is done fcr clone 1 and pABl, resulting in pAB2; for clone 
2 and pBCl, resulting in pBC2; and for clone 3 and pCDl, 
resulting in pCD2 . Each insert from the original 
overlapping clones (Fig. 5) is thus transferred into the ts 

25 replicon, ais outlined in Fig. 7. Subsequently, the inserts 
from clone 1, now present in the ts plasmid pAB2, is 
introduced into the pESAC construct carrying the entire A- 
B-C-D cassette. This is done by selecting for the 
interplasmid cointegrate between pAB2 and the pESAC 

30 construct at the non-permissive temperature, and then 
resolving the cointegrate at the permissive temperature, 
selecting for Km R Ab s colonies. This leads to insert 
exchange between the two replicons (as shown in Fig. 8). 
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Next, a selectable marker is introduced in the growing ESAC 
between the next fragment pair, again through the use of 
two rounds of single, reciprocal homologous recombination 
mediated by plasmid p3Cl, leading to the appearance of Km R 
5 Ab R colonies. Subsequently, the interplasmid exchange with 
pBC2 leads to the introduction of the genomic segment 
comprised between fragments B and C. Finally, the use of 
pCDl first and subsequently of pCD2 leads to the 
reconstruction of the genomic segment into the pESAC. 

10 Therefore, through the use of alternating steps, the Ab R 
marker first and the genomic segment later are introduced 
between any fragment pair, as schematized in Fig. 8. This 
iterative procedure results in the reconstruction of the 
original chromosomal region in the pESAC. 

15 A series of examples described herein illustrate how a 

90-kb gene segment from the actinomycete P. rosea is 
assembled from three pre-existing cosmids via homologous 
recombination. The cosmids, designated pRP16, pRP31 and 
pRP58 f are identified in a cosmid library constructed in 

20 the vector Lorist6 by the use of selective hybridization 
probes. The relevant information about the cluster is 
reported in Fig. 9. The reconstruction of the cluster 
results in the formation of the intermediate derivatives 
pPADl, PAD2, PAD4 and PAD6, carrying inserts of 10, 39, 68 

25 and 89 kb, respectively. The examples reported herein serve 
to illustrate the principles and methodologies of the 
present invention and are not meant to restrict its scope. 

Example 18 

30 Isolation of cosmid clones pRPl.6, pRP31 and pRP58 

A cosmid library of P. rosea DNA prepared in the vector 
Lorist6 is screened with oligonucleotide probes Pep6 and 
Pep8, according to the conditions described under Section 
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6. Among the positive colonies identified, several cosmids 
were found to span the ca. 90 kb segment of the P. rosea 
chromosome reported in Fig. 9. Signature sequences close to 
the left and right ends of this segment are reported as 
5 SEQID N°9 and SEQID N°10, respectively. Three cosmids are 
chosen for further studies. Cosmids pRP16, pRP31 and pRP58 
exhibits, after digestion with BamRI and resolution by 
agarose gel-electrophoresis, fragments of 7.5, 7.2, 5.6, 
5.2, 2.7, 2.0, 1.9, 1.9, 1.8, 1.6, 1.4, 0.9 and 0.7 kb; of 
10 10.5, 6.2, 3.1, 2.8, 2.6, 2.5, 2.1, 1.9, 1.9, 1.5, 1.4, 
1.2, 1.0, 1.0, 0.9, 0.9, 0.7, 0.6, 0.5, 0.5, 0.1 and 0.1 
kb; and of 10.0, 7.6, 6.7, 6.2, 3.4, 3.0, 2.8, 2.1, 1.0, 
1.0, 0.9, 0.9, 0.7, 0.6, 0.5, 0.5, 0.3 and 0.1 ir- 
respectively. 

15 

Example 19 

Construction of plasmid pUAl 

The 0.9 kb Sjnal-SstI fragment, comprised between map 
coordinates 2.0-2.9 kb of Fig. 9, is obtained from cosmid 
20 pRP16, prepared as described in Example 18, and ligated to 
pUC18 previously digested with SstI and Smal. The resulting 
mixture contains the desired plasmid pUAl. 

Example 20 

25 Construction of E. coli K12 XLlblue/pUAl 

Approximately 10 ng of plasmid pUAl, prepared as described 
in Example 19, are used to transform E . coli XLlblue and a 
few of the resulting Ap R colonies that appear on the LB-agar 
plates are analyzed for their plasmid content. One colony 

30 is found to carry pUAl, as verified by the observation, 
upon agarose gel-electrophoresis, of fragments of; 2.7 and 
0.9 kb after digestion of the plasmid with BamHI + SstI. 
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Example 21 

Construction of plasmid pUA2 

The 0.9 kb BamHI-SstI fragment from pUAL, prepared as 
described in Example 20, is ligated to pUCBM20 previously 
5 digested with BamKI and Sstl. The resulting mixture 
contains the desired plasmid pUA2 . 

Example 22 

Construction of E. coli K12 XLlblue/pUA2 

10 Approximately 10 ng of plasmid pUA2 , prepared as described 
in Example 21, are used to transform E. coli XLlblue and a 
few of the resulting Ap R colonies that appear on the LB-agar 
plates are analyzed for their plasmid content. One colony 
is found to carry pUA2, as verified by the observation, 

15 upon agarose gel-electrophoresis , of fragments of 2.7 and 
0.9 kb after digestion of the plasmid with EcoRI + Sstl. 

. Example 23 
Construction of plasmid pUBl 
20 The 1.8 kb Sstl-BamHI fragment, comprised between map 
coordinates 33.4-35.2 of Fig. 9, is obtained from cosmid 
pRP16, prepared as described in Example 18, and ligated to 
pUC18 previously digested with Sstl + BamHI . The ligation 
mixture contains the desired plasmid pUBl. 

25 

Example 2 4 

Construction of E. coli K12 XLlblue/pUBl 

Approximately 10 ng of plasmid pUBl, prepared as described 
in Example 23, are used to transform E. coli XLlblue and a 
30 few of the resulting Ap R colonies that appear on the LB-agar 
plates are analyzed for their plasmid content. One colony 
is found to contain plasmid pUBl as verified by the 
observation, upon agarose gel electrophoresis, of fragments 
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2.7 and 1.8 kb after digestion with SstI + Xbal. 
Example 25 

Construction of plasmid pUCl 
5 The 6.2 kb BamHI fragment, comprised between map 
coordinates 54.2-60.4 kb of Fig. 9, is obtained from cosmid 
pRP58, prepared as described in Example 18, and ligated to 
pUC18 previously digested with BamHI. The ligation mixture 
contains the desired plasmid pUCl. 

10 

Example 2 6 

Construction of E . coli K12 XLlblue/pUCl 

Approximately 10 ng of plasmid pUCl, prepared as described 
in Example 25, are used to transform E. coli XLlblue and a 
15 few of the resulting Ap R colonies that appear on the LB-agar 
plates are analyzed for their plasmid content. One colony 
is found to carry pUCl, as verified by the observation, 
upon agarose gel-electrophoresis, of fragments of 4.9 and 
4.0 kb after digestion of the plasmid with Pstl- 

20 

Example 27 

Construction of plasmid pUDl 

Synthetic oligonucleotides 5 1 -GATCTAAGCTTGGGGG-3 ' (SEQID 
N°ll) and 5 ! -CCCCCAAGCTTA-3' (SEQID N°12) are annealed and 

25 ligated to the 1.5 kb Pstl-BamHI fragment, comprised 
between map coordinates 89.5-91.0 kb of Fig. 9 and obtained 
from cosmid pRP58, prepared as described in Example 18. The 
ligation mixture is digested with Hindlll and ligated to 
pUC18 previously digested with PstI + Hindlll. The 

30 resulting mixture contains the desired plasmid pUDl. 

Example 28 

Construction of E. coli K12 XLlblue/pUDl 
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Approximately 10 ng of plasmid pUDl, prepared as described 
in Example 27, are used to transform E. coli XLlblue and a 
few of the resulting Ap R colonies that appear on the LB-agar 
plates are analyzed for their plasmid content. One colony 
5 is found to contain plasmid pUDl as verified by the 
observation, upon agarose gel-electrophoresis, of fragments 
of 2.7 and 1.5 kb after digestion with PstI + Hindlll. 

Example 2 9 

10 Construction of plasmid pUABl 

The 0.9 kb £coRI-SstI fragment from plasmid pUA2, prepared 
as described in Example 22, and the 1.8 kb Ss tl-Bamhl 
fragment from pUBl, prepared as described in Example 24, 
are ligated to pUC18 previously digested with EcoRI + 

15 BamRI . The ligation mixture contains the desired plasmid 
pUABl. 

Example 30 

Construction of E. coli K12 XLlblue/pUABl 

20 Approximately 10 ng of plasmid pUABl, prepared as described 
in Example 29, are used to transform E. coli XLlblue and a 
few of the resulting Ap R colonies that appear on the LB-agar 
plates are analyzed for their plasmid content. One colony 
is found tc carry pUABl, as verified by the observation, 

25 upon agarose gel-electrophoresis, of two fragments of 2.7 
kb after digestion of the plasmid with EcoRI + Xbal . 

Exam ple 31 

Isolation of the tet fragment 
30 The 1.6 kb fragment containing the tet gene is isolated 
after PCR amplification of pBR322 DNA using 
oligonucleotides 5 ' -GAGCTCTCATGTTTGACAGCT-3 ' (SEQID N°13) 
and 5 1 -GAGCTCTGACTTCCGCGTTTCCAG-3 1 (SEQID N°14) as primers, 
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followed by digestion with SstI. 
Example 32 

Construction of plasmid pUAB2 
5 Plasmid pUABl, prepared as described in Example 30, is 
digested with SstI and ligated to the tet fragment prepared 
as described in Example 31, The ligation mixture contains 
the desired plasmid pUAB2 . 

10 Example 33 

Construction of E. coli K12 DH5a/pUAB2 

Approximately 10 ng of plasmid pUAB2 , prepared as described 
in Example 32, are used to transform E. coli DH5a and a few 
of the resulting Tc R Ap R colonies that appear on the LB-agar 
15 plates are analyzed for their plasmid content. One colony 
is found to carry pUAB2, as verified by the observation, 
upon agarose gel-electrophoresis, of fragments of 4.3 and 
2.7 kb after digestion of the plasmid with EcoRI + Xjbal . 

20 Example 34 

Construction of plasmid pUBCl 

The 1.8 kb Sstl-Xbal fragment obtained from plasmid pUBl, 
prepared as described in Example 24, and the 4.0 kb Xbal- 
Pstl fragment obtained from plasmid pUCl, prepared as 
25 described in Example 26, are ligated to pUC18 previously 
digested with SstI + Pstl. The ligation mixture contains 
the desired plasmid pUBCl. 

Example 35 

30 Construction of E. coli K12 XLlblue/pUBCl 

Approximately 10 ng of plasmid pUBCl, prepared as described 
in Example 34, are used to transform E. coli XLlblue and a 
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few of the resulting Ap R colonies that appear on the LB-agar 
plates are analyzed for their plasmid content. One colony 
is found to carry pUBCl, as verified by the observation, 
upon agarose gel-electrophoresis , of fragments of 5.8 and 
5 2.7 kb after digestion of the plasmid with EcoRI + ffindlll. 

Example 36 

Construction of plasmid pUBC2 

Plasmid pUBCl, prepared as described in Example 35 and 
10 previously digested with Xbal, and the tet fragment, 
prepared as described in Example 31 , are treated with T4 
DNA polymerase and T4 DNA ligase. The ligation mixture 
contains the desired plasmid pUBC2 . 

15 Example 37 

Construction of E. coli K12 DH5 a /pUBC2 

Approximately 10 ng of plasmid pUBC2, prepared as described 
in Example 36/ are used to transform E. coli DH5a and a few 
of the resulting Tc R Ap R colonies that appear on the LB-agar 
20 plates are analyzed for their plasmid content. One colony 
is found to carry pUBC2, as verified by the observation, 
upon agarose gel-electrophoresis, of fragments of 5.6 and 
4.5 kb after digestion of the plasmid with ffindlll. 

25 Example 38 

Construction of plasmid pUCDl 

The 4.0 kb Xbal-Ps tl fragment obtained from plasmid pUCl, 
prepared as described in Example 26, and the 1.5 kb Pstl- 
Hindlll fragment isolated from plasmid pUDl, prepared as 
30 described in Example 28, are ligated to pUCIS previously 
digested with Xjbal and tfindlll. The mixture contains the 
desired plasmid pUCDl. 
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Example 39 

Construction of E. coll K12 XLlblue/pUCDl 

Approximately 10 ng of plasmid pUCDl, prepared as described 
5 in Example 38, are used to transform E. coll XLlblue and a 
few of the resulting Ap R colonies that appear on the LB-agar 
plates are analyzed for their plasmid content. One colony 
is found to carry pUCDl, as verified by the observation, 
upon agarose gel-electrophoresis, of fragments of 5.5 and 
10 2.7 kb after digestion of the plasmid with Xbal + HindXII . 

Example 4 0 

Construction of plasmid pUCD2 

Plasmid pUCDl, prepared as described in Example 39 and 
15 previously digested with PstI, and the tet fragment 
prepared as described in Example 31, are treated with T4 
DNA polymerase and T4 DNA ligase. The ligation mixture 
contains the desired plasmid pUCD2. 

20 Example 41 

Construction of E. coll K12 DH5a/pUCD2 

Approximately 10 ng of plasmid pUCD2, prepared as described 
in Example 40, are used to -ransform E. coll DH5a and a few 
of the resulting Tc R Ap R colonies that appear on the LB-agar 
25 plates are analyzed for their plasmid content- One colony 
is found to carry pUCD2, as verified by the observation, 
upon agarose gel-electrophoresis, of fragments of 6.7 and 
3.1 kb after digestion of the plasmid with Hindlll. 

30 Example 4 2 

Construction of plasmid pUADl 

The 4.3 kb TcoRI-Xbal fragment obtained from plasmid pUAB2, 
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prepared as described in Example 33 , and the 5.5 Xbal- 
Hindlll fragment from plasmid pUCDl, prepared as described 
in Example 39, are ligated to pUC18, previously digested 
with EcoRI + ifindlll. The ligation mixture contains the 
5 desired plasmid pUADl. 

Example 43 

Construction of E. coli K12 DH5a/pUADl 

Approximately 10 ng of plasmid pUADl, prepared as described 
10 in Example 42, are used to transform E. coli DH5a and a few 
of the resulting Tc R Ap R colonies that appear on the LB-agar 
plates are analyzed for their plasmid content. One colony 
is found to carry pUADl, as verified by the observation, 
upon agarose gel-electrophoresis, of fragments of 8.9 and 
15 3.6 kb after digestion of the plasmid with Hindlll. 

Example 4 4 

Construction of plasmid pMABl 

The 4.3 kb EcoRI-XJbal fragment obtained from plasmid pUAB2, 
20 prepared as described in Example 33, is treated with T4 DNA 
Polymerase and ligated to pMAK705 previously digested with 
Hindi. The ligation mixture contains the desired plasmid 
pMABl . 

25 Example 45 

Construction of E. coli K12 C6Q0/pMABl 

Approximately 10 ng of plasmid pMABl, prepared as described 
in Example 44, arc used to transform E. coli C600 and a few 
of the resulting Cm R Tc R colonies that appear on the LB-agar 
30 plates are analyzed for their plasmid content. One ' colony 
is found to carry pMABl, as verified by the observation, 
upon agarose gel-electrophoresis, of fragments of 4.1, 3.4, 
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1.4 and 0.9 kb after digestion of the plasmid with Hindlll 
+ EcoRI . 

Example 4 6 
5 Construction of plasmid pMBCl 

The 7.1 kb fragment from plasmid pUBC2, prepared as 
described in Example 37 , is obtained after partial 
digestion with PstI, treated with T4 DNA polymerase and 
ligated to pMAK705 previously digested with Hindi. The 
10 ligation mixture contains the desired plasmid pMBCl. 

Example 4 7 

Construction of E. coli K12 C600/pMBCl 

Approximately 10 ng of plasmid pMBCl, prepared as described 
15 in Example 46, are used to transform E. coli C600 and a few 
of the resulting Cm R Tc R colonies that appear on the LB-agar 
plates are analyzed for their plasmid content . One colony 
is found to carry pMBCl, as verified by the observation, 
upon agarose gel-electrophoresis , of fragments of 9.5, 1.5, 
20 1.3 and 0.3 kb after digestion of the plasmid with BamRI . 

Example 4 8 

Construction of plasmid pMCDl 
The 7.1 kb fragment from plasmid pUCD2, prepared as 
25 described in Example 41, is obtained by complete digestion 
with Xbal and partial digestion with Hindlll, treated with 
T4 DNA polymerase and ligated to pMAK705, previously 
digested with Hindi. The ligation mixture contains the 
desired plasmid pMCDl . 

30 

Example 4 9 

Construction of E. coli K12 C60Q/pMCDl 

Approximately 10 ng of plasmid pMCDl, prepared as described 
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in Example 48, are used to transform E. coli C600 and a few 
of the resulting Cm R Tc R colonies that appear on the LB-agar 
plates are analyzed for their plasmid content. One colony 
is found to carry pMCDl, as verified by the observation, 
5 upon agarose gel-electrophoresis, of fragments of 8.6 and 
4.3 kb after digestion of the plasmid with BamRI . 

Example 50 

Construction of plasmid pPADl 

The 10.0 kb EcoRI-Ndel fragment from plasmid pUADl, 

prepared as described in Example 43, is ligated to pPAC-Sl, 
prepared as described in Example 11 and previously digested 

with Seal. The ligation mixture contains the desired 
plasmid pPADl. 

Example 51 

Construction of E. coli K12 C6QQ/pPADl 

Approximately 10 ng .of plasmid pPADl, prepared as described 
in Example 50, are used to transform E. coli C600 and a few 
of the resulting Km R Tc R colonies that appear on the LB-agar 
plates are analyzed for their plasmid content. One colony 
is found to carry pPADl, as verified by the observation, 
upon agarose gel-electrophoresis, of fragments of 19.7, 
5.8, 3.1 and 1.2 kb after digestion of the plasmid with 
BamRI . After digestion with Dral and resolution by PFGE, 
pPADl yields fragments of 17.4, 7.4, 4.2 and 0.6 kb. 

Example 52 

Construction of E. coli K12 C600/pMABl : ;pRP16 
30 E. coli C600/pMABl, prepared as described in Example 45, is 
transformed with ca . 50 ng of pRP16, prepared as described 
in Example 18. The Cm R Km R colonies that appear at 30 °C on 
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the LB-agar plates are grown at 30°C in LB broth containing 
Km and Cm, aliquots are withdrawn at various times and 
appropriate dilutions plated. Few of the Cm R Km R colonies 
that appear on the LB-agar plates after overnight 
5 incubation at 4 4°C are grown in LB broth containing Km and 
Cm for 16 h at 4 4°C and analyzed for their plasmid content. 
One colony is found to carry pMABl : :pRP16, as verified by 
the observation, upon agarose gel-electrophoresis, of 
fragments of 34, 10.7, 1.7, 1.6, 1.5, 1.2 and 0.6 leb after 
10 digestion of the plasmid with EcoRI . 

Example 53 

Construction of E. coli K12 C6Q0/pMAB2 

Several colonies of £. coli C600/pMABl : :pRP16, prepared as 
15 described in Example 52, are grown individually in LB broth 
containing Cm for 24 h at 30°C, diluted 1:100 and incubated 
for further 8 h. Appropriate dilutions are plated. Few of 
the resulting Cm R Km s Tc s colonies that appear at 30°C are 
analyzed for their plasmid content. One colony is found to 
20 carry pMAB2, as verified by the observation, upon agarose 
gel-electrophoresis, of fragments of 37 and 1.5 kb after 
digestion of the plasmid with £coRI . 

Example 54 

25 Construction of E . coli K12 DHl/pMBCl : : pRP31 

Approximately 50 ng of pRP31, prepared as described in 
Example 18, are used to transform E. coli DH1 cells. 
Competent cells are prepared from one of the resulting Km R 
colonies and transformed with ca. 10 ng of plasmid pMBCl, 

30 prepared as described in Example 47. The Cm R Km R colonies 
that appear at 30°C on the LB-agar plates are grown at 30°C 
in LB broth containing Km and Cm, aliquots are withdrawn at 
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various times and appropriate dilutions plated. Few of the 
Cm R Km R colonies that appear on the LB-agar plates after 
overnight incubation at 44°C are grown in LB broth 
containing Km and Cm for 16 h at 44°C and analyzed for their 
5 plasmid content. One colony is found to carry pMBCl : :pRP31, 
as . verified by the observation, upon agarose gel- 
electrophoresis, of fragments of 22.2, 14.1, 14.0 and 6.0 
kb after digestion of the plasmid with £coRV. 

10 Example 55 

Construction of E. coli K12 DHl/pMBC2 

Several colonies of E. coli DHl/pMBCl : :pRP31, prepared as 
described in Example 54, are grown individually in LB broth 
containing Cm for 24 h at 30°C, diluted 1:100 and incubated 

15 for further 8 h. Appropriate dilutions are plated. Few of 
the resulting Cm R Km s Tc s colonies that appear at 30°C are 
analyzed for their plasmid content. One colony is found to 
carry pMBC2, as verified by the observation, upon agarose 
gel-electrophoresis, of fragments of 14.4, 14.1 and 1.5 kb 

20 after digestion of the plasmid with EcoRI . 

Example 56 

Construction of E. coli K12 DHl/pMCDl : :pRP58 

Approximately 50 ng of pRP58, prepared as described in 

25 Example 18, are used to transform E. coli DH1 cells. 
Competent cells are prepared from one of the resulting Km R 
colonies and transformed with ca . 10 ng of plasmid pMCDl, 
prepared as' described in Example 48. The Cm R Km R colonies 
that appear at 30°C on the LB-agar plates are grown at 30°C 

30 in LB broth containing Km and Cm, aliquots are withdrawn at 
various times and appropriate dilutions plated. Few of the 
Cm R Km R colonies that appear on the LB-agar plates after 
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overnight incubation at 44°C are grown in LB broth 
containing Km and Cm for 16 h at 4 4°C and analyzed for their 
plasmid content. One colony is found to carry pMCDl : :pRP58 , 
as verified by the observation, upon agarose gel- 
5 electrophoresis, of fragments of 39, 16, 1.7, 1.6, 1.5, 1.2 
and 0.6 kb after digestion of the plasmid with £coRI . 

Example 57 

Construction of E.coli K12 DHl/pMCD2 

10 Several colonies of E. coli DHl/pMCDl : :pRP58 , prepared as 
described in Example 56, are grown individually in LB broth 
containing Cm for 24 h at 30°C, diluted 1:100 and incubated 
for further 8 h. Appropriate dilutions are plated. Few of 
the resulting Cm R Km s Tc s colonies that appear at 30°C are 

15 analyzed for their plasmid content. One colony is found to 
carry pMCD2, as verified by the observation, upon agarose 
gel-electrophoresis, of fragments of 42 and 1.5 kb after 
digestion of the plasmid with EcoRI . 

20 Example 58 

Construction of E. coli K12 C6QQ/pMAB2 : : pPADl 
E. coll C600/pMAB2, prepared as described in Example 53, is 
transformed with ca . 50 ng of plasmid pPADl, prepared as 
described in Example 51. The Cm R Km R colonies that appear at 

25 30°C on the LB-agar plates are grown at 30°C in LB broth 
containing Km and Cm, aliquots are withdrawn at various 
times and appropriate dilutions plated. Few of the Cm R Km R 
colonies that appear on the LB-agar plates after overnight 
incubation at 4 4°C are grown in LB broth containing Km and 

30 Cm for 16 h at 44°C and analyzed for their plasmid content. 
One colony is found to carry pMAB2 : :pPADl, as verified by 
the observation, upon agarose gel-electrophoresis, of 
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fragments of 19.7, 7.2, 5.6, 5.6, 5.5, 5.2, 3.1, 2.7, 1.9, 
1.9, 1.8, 1.8, 1.6, 1.4, 1.2, 0.9, 0.9 and 0.7 kb after 
digestion of the plasmid with BamHI . 

5 Example 59 

Construction of E.coli K12 C600/PAD2 

Several colonies of E. coli C600/pMAB2 : :pPADl r prepared as 
described in Example .58, are grown individually in LB 
containing Km for 24 h at 30°C, diluted 1:100 and incubated 

10 for further 8 h at 44°C. Appropriate dilutions are plated. 
Few of the resulting Km R Cm s Tc s colonies that appear at 37°C 
are analyzed for their plasmid content. One colony is found 
to carry PAD2, as verified by the observation, upon agarose 
gel-electrophoresis, of fragments of 19.7, 7.2, 5.6, 5.5, 

15 5.2, 2.7, 1.9, 1.9, 1.8, 1.8, 1.6, 1.4, 0.9 and 0.7 kb 
after digestion of the plasmid with BamRI . After Dral 
digestion and resolution by PFGE, PAD2 yields fragments of 
45, 7.4, 4.2 and 0.6 kb. 

20 Example 60 

Construction of plasmid pMCD3 
The 1.4 kb KpnI-XhoII fragment obtained from plasmid 
pCYPAC2 after digestion with XhoII, treatment with T4 DNA 
polymerase and digestion with Kpnl, and the 7.1 kb Xbal- 

25 Hindlll fragment from pUCD2, prepared as described in 
Example 4 0 and obtained after partial digestion with 
Hindi II, complete digestion with Xbal and treatment with T4 
DNA polymerase, are ligated to pMAK705, previously digested 
with Kpnl + Hindi. The ligation mixture contains the 

30 desired plasmid pMCD3. 

Example 61 
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Construction of E. coli K12 C6QQ/pMCD3 

Approximately 10 ng of plasmid pMCD3, prepared as described 
in Example 60, are used to transform E. coli C600 and a few 
of the resulting Cm R Tc R colonies that appear on the LB-agar 
5 plates are analyzed for their plasmid content. One colony 
is found to carry pMCD3, as verified by the observation, 
upon agarose gel-electrophoresis, of fragments of 9.8 and 
4.3 kb after digestion of the plasmid with BamRI . 

Example 62 

Construction of E. coli K12 C600/PAD2 : : pMCD3 

E. coli C600/PAD2, prepared as described in Example 59, is 
transformed with ca . 10 ng of plasmid pMCD3, prepared as 
described in Example 61. The Cm R Km R colonies that appear at 
30°C on the LB-agar plates are grown at 30°C in LB broth 
containing Km and Cm, aliquots are withdrawn at various 
times and appropriate dilutions plated. Few of the Cm R Km R 
colonies that appear on the LB-agar plates afzer overnight 
incubation at 4 4°C are grown in LB broth containing Km and 
Cm for 16 h at 44°C and analyzed for their plasmid content. 
One colony is found to carry PAD2::pMCD3, as verified by 
the observation, upon agarose gel-electrophoresis, of 
fragments of 19.7, 9.8, 7.2, 5.6, 5.5, 5.2, 4.3, 2.7, 1.9, 
1.9, 1.8, 1.8, 1.6, 1.4, 0.9 and 0.7 kb after digestion of 
the plasmid with BamRI . 

Example 63 

Construction of E . coli K12 C600/PAD3 

Several colonies of E. coli C600/PAD2 : : pMCD3 , prepared as 
30 described in Example 62, are grown individually in LB broth 
containing Km for 24 h at 30°C, diluted 1:100 and incubated 
for further 8 h at 44°C. Appropriate dilutions are plated. 
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Few of the resulting Km R Cm s Tc R colonies are analyzed for 
their plasmid content. One colony is found to carry PAD3, 
as verified by the observation, upon agarose gel- 
electrophoresis, of fragments of 21.5, 7.2, 5.6, 5.2, 4.3, 
5 2.7, 1.9, 1.9, 1.8, 1.8, 1.6, 1.4, 0.9 and 0.7 kb after 
digestion of the plasmid with BamHI . 

Example 64 

Construction of E. coli K12 C600/PAD3 : :pMCD2 
10 E. coli C600/PAD3, prepared as described in Example 63, is 
transformed with ca . 50 ng of plasmid pMCD2, prepared as 
described in Example 57. The Cm R Km R colonies that appear at 
30°C on the LB-agar plates are grown at 30°C in LB broth 
containing Km and Cm, aliquots are withdrawn at various 
15 times and appropriate dilutions plated. Few of the Cm R Km R 
colonies that appear on the LB-agar plates after overnight 
incubation at 44°C are grown in LB broth containing Km and 
Cm for 16 h at 4 4°C and analyzed for their plasmid content. 
One colony is found to carry PAD3::pMCD2, as verified by 
20 the observation, upon agarose gel-electrophoresis , of 
fragments of 21.5, 10, 9,0, 7.6, 7.2, 6.2 f 5.6, 5.2, 4.3, 
3.1, 2.8, 2.7, 1.9, 1.9, 1.8, 1.8, 1.6, 1.4, 1.0, 0.9, 0.9, 
0.9, 0.7, 0.5, 0.3 and 0.1 kb after digestion of the 
plasmid with BamHI . 

25 

Example 65 

Construction of E. coli K12 C60Q/PAD4 

Several colonies of £. coli C600/PAD3 : : pMCD2 , prepared as 
described in Example 64, are grown individually in LB broth 
30 containing Km for 24 h at 30°C, diluted 1:100 and incubated 
for further 8 h at 44°C. Appropriate dilutions are plated. 
Few of the resulting Km R Cm s Tc s colonies are analyzed for 
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their plasmid content. One colony is found to carry PAD4, 
as verified by the observation, upon agarose gel- 
electrophoresis, of fragments of 22, 10, 7.6, 7.2, 6.2, 
5.6, 5.2, 3.1, 2.8, 2.7, 1.9, 1.9, 1.8, 1.8, 1.6, 1.4, 1.0, 
5 0.9, 0.9, 0.9, 0.7, 0.5, 0.3 and 0.1 kb after digestion of 
the plasmid with BamHI . After Dral digestion and resolution 
by PFGE, PAD4 yields fragments of 79, 4.2 and 0.6 kb. 

Example 66 

Construction of E. coli K12 C600/PAD4 : :pMBCl 

E. coli C600/PAD4, prepared as described in Example 65, is 
transformed with ca. 10 ng of plasmid pMBCl, prepared as 
described in Example 47. The Cm R Km R colonies that appear at 
30°C on the LB-agar plates are grown at 30°C in LB broth 
containing Km and Cm, aliquots are withdrawn at various 
times and appropriate dilutions plated. Few of the Cm R Km R 
colonies that appear on the LB-agar plates after overnight 
incubation at 4 4°C are grown in LB broth containing Km and 
Cm for 16 h at 4 4°C and analyzed for their plasmid content. 
One colony is found to carry PAD4::pMBCl, as verified by 
the observation, upon agarose gel-electrophoresis, of 
fragments of 22, 10, 9.6, 7.6, 7.2, 6.2, 5.6, 5.2, 3.1, 
2.8, 2.7, 1.9, 1.9, 1.8, 1.8, 1.6, 1.5, 1.4, 1.3, 1.0, 0.9, 
0.9, 0.9, 0.7, 0.5, 0.3, 0.3 and 0.1 kb after digestion of 
the plasmid with BamHI. 

Example 67 

Construction of £. coli K12 C600/PAD5 

Several colonies of E. coli C600/PAD4 : : pMBCl , prepared as 
30 described in Example 66, are grown individually in LB broth 
containing Km for 24 h at 30°C, diluted 1:100 and incubated 
for further 8 h at 44°C. Appropriate dilutions are plated. 
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Few of the resulting Km R Cm s Tc R colonies are analyzed for 
their plasmid content. One colony is found to carry PADS, 
as verified by the observation, upon agarose gel- 
electrophoresis, of fragments of 22, 10, 7.6, 7.2, 6.2, 
5 5.6, 5.2, 3.1, 2.8, 2.7, 1.9, 1.9, 1.8, 1.8, 1.6, 1.4, 1.3, 
1.0, 0.9, 0.9, 0.9, 0.7, 0.5, 0.3, 0.3 and 0.1 kb after 
digestion of the plasmid with BamHl . 

Example 68 

10 Construction of E. coli K12 C60Q/PAD5: :pMBC2 

E. coli C600/PAD5, prepared as described in Example 67, is 
transformed with ca. 50 ng of plasmid pMBC2, prepared as 
described in Example 55. The Cm R Km R colonies that appear at 
30°C on the LB-agar plates are grown at 30°C in LB broth 

15 containing Km and Cm, aliquots are withdrawn at various 
times and appropriate dilutions plated. Few of the Cm R Km R 
colonies that appear on the LB-agar plates after overnight 
incubation at 4 4°C are grown in LB broth containing Km and 
Cm for 16 h at 44°C and analyzed for their plasmid content. 

20 One colony is found to carry PAD5::pMBC2, as verified by 
the observation, upon agarose gel-electrophoresis, of 
fragments of 65, 33, 5.6, 4.7, 3.4, 2.8, 2.1, 1.2, 1.2, 1.0 
and 0.4 kb after digestion of the plasmid with Hindlll. 

25 Example 69 

Construction of E. coli K12 C6Q0/PAD6 

Several colonies of E. coli C600/PAD5 : :pMBC2, prepared as 
described in Example 68, are grown individually in LB broth 
containing Km for 24 h at 30°C, diluted 1:100 and incubated 
30 for further 8 h at 44°C. Appropriate dilutions are plated. 
Few of the resulting Km R Cm s Tc s colonies are analyzed for 
their plasmid content. One colony is found to carry the 
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correct ESAC, designated PAD6, as verified by the 
observation, upon agarose gel-electrophoresis, of fragments 
of 47, 46, 8.1, 4.6, 2.2, 0.5 and 0.1 kb after digestion of 
the plasmid with £coRI. After digestion with Dral and 
5 resolution by PFGE, PAD6 yields fragments of 102, 4.2 and 
0.6 kb. 

Although the present invention is described in the 
Examples listed above in terms of preferred embodiments, 

10 they are not to be regarded as limiting the scope of the 
invention. The above Examples serve to illustrate the 
principles and methodologies for assembling pre-existing 
overlapping segments of DNA into pESAC. 

It will occur to those skilled in the art that the 

15 cluster of Fig. 9 can be assembled using A-B-C-D fragments 
other than those specified in the Examples. Any A fragment, 
such that no useful genes are present to its left (using 
the orientation of Fig. 9) can be used for assembling the 
cluster. Similarly, any D fragment, such that no useful 

20 genes are present to its right (using the orientation of 
Fig. 9) can also be used. Furthermore, any fragment common 
to pRP16 and pRP31, or to pRP31 and pRP58, can be used in 
place of the fragments B and C, respectively, described 
above. It will also occur to those skilled in the art that 

25 other methods for obtaining these fragments, such as use of 
different segments from the cluster of Fig. 9, of different 
restriction endonucleases, or of the PCR, can be used for 
achieving equivalent results. In addition, intermediate 
vectors, other than the pUO series used in the above 

.30 Examples, can be used for subcloning fragments A through D, 
and the use of these intermediate vectors is merely 
instrumental to the transfer of the fragment pairs into the 
ts vector. Some or all of the fragment pairs could 
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therefore be cloned directly into a ts vector. 

It will also occur to those skilled in the art that 
cosmids other than pRP16, pRP31 and pRP58 can be used to 
achieve equivalent results r provided that they encompass 
5 the entire gene cluster and they have overlapping segments. 
It will also occur to those skilled in the art that 
pMAK705, Lorist6 and pPAC-Sl, are merely examples of ts, 
cosmid and pESAC, respectively. Any of the several cosmid 
vectors described in the literature, other ts replicons 

10 derived from pMAK7 05 or other sources, and any of the pESAC 
other than pPAC-Sl, which include the possible vectors 
described in Section 7.1, can be used for obtaining 
equivalent results. 

Those skilled in the art understand that the purpose 

15 of a ts replicon is to select for interplasmid cointegrates 
at the non-permissive temperature. However, cointegrate 
formation can occur between any two replicons, and 
cointegrate can be isolated after transformation of 
suitable hosts with a plasrnid preparation made from an E. 

20 coli cell harboring both replicons. Selection for the 
antibiotic resistance markers carried by both replicons can 
lead to the isolation of cointegrates from the resulting 
transf ormants . 

Furthermore, it will occur to those skilled in the art 
25 that the inclusion of the tet marker between the A-B, B-C 
and C-D fragment pairs serves solely the scope of 
recognizing insert exchange after resolution of the 
interplasmid cointegrate . Selectable markers other than tet 
can be equally effective, as long as they are not present 
30 in the vectors; Those skilled in the art understand that 
the presence of a selectable marker within the fragment 
cassettes is not absolutely necessary, as insert exchange 
can be observed by other methods, such as selective 
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hybridization or PCR. Similarly, different £. coli hosts 
other than those used in the above Examples can be also 
employed . 

It will also occur to those skilled in the art that, 
5 as described in Examples 58 through 69, interplasmid insert 
exchange can be obtained in a sequel independent of the 
order of the overlapping cosmid clones in the genomic 
contig. Indeed, the schematic of Fig. 8 illustrates the 
sequel of interplasmid exchanges A-B, followed by B-C and 

10 then by C-D, while Examples 58 through 69 describe the 
sequel A-B, C-D and last B-C. Furthermore, technical 
variations on the methodologies employed here can produced 
equivalent results. All these variations fall within the 
scope of the present invention. 

15 It will occur to those skilled in the art that the 

principles and methodologies described in Sections 7.2.1 
and 7.2.2 are not mutually exclusive. For example, a 
construct equivalent to PAD6 can be directly isolated by 
subjecting the producer strain P. rosea to the principles 

20 and methodologies described in Section 7.2.1. Similarly, 
selected cosmids from the described S. coelicolor library 
(Redenbach et al . , 1996, Mol. Microbiol. 21:77-96) can be 
used for assembling a large chromosomal segment into pPAC- 
Sl, following the principles and methodologies described in 

25 Section 7.2.2. Furthermore, it will' occur to those skilled 
in the art that the principles and methodologies of Section 
7.2.1 and 7.2.2 can complement each other. For example, 
after constructing an ESAC library of P. rosea DNA, inserts 
from individual ESACs may be enlarged by applying the 

30 principles and methodologies of Section 7.2.2, using, for 
example, cosmids overlapping the cognate ESACs. 

Those skilled in the art understand that the 
principles and methodologies described in Section 7.2.2 and 
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illustrated in schematic form in Fig. 4 are general enough 
that they can be applied to other strains and clusters. 
Methods for preparing high molecular weight DNA, for 
constructing and propagating in E. coli an ESAC library can 
5 be developed from the principles and methodologies 
described in Examples 12 through 17. Methods for preparing 
the appropriate combinations of fragment pairs to yield the 
starting plasmids described in Fig. 6, can be developed for 
other clusters following the principles and methodologies 

10 described in Examples 19 through 51; methods for assembling 
an entire cluster into a pESAC can be developed following 
the principles and methodologies described in Examples 52 
through 69. In order to illustrate how the principles and 
methodologies described in Section 7.2 can be extended to 

15 other actinomycete strains producing different natural 
products, the constructions of ESACs carrying large gene 
clusters from different producer strains are reported 
herein. The Examples describe, for each cluster , ■ the 
selection of the appropriate fragments A, B, C and D; and 

20 the construction of the starting plasmids, equivalent to 
those reported in Fig. 6. These plasmids can then be used 
to to reassemble each cluster according to the scheme of 
Fig. 8. 

The rapamycin gene cluster from S. hygroscopicus is 
25 contained within three overlapping cosmids designated 
cos58, cos25 and cos2 (Schwecke et al . , 1995, Proc. Natl. 
Acad. Sci. USA 92:7839-7843). The Examples described herein 
report the preparation of the appropriate fragments A, B, C 
and D; the construction of the plasmids containing the A-B, 
30 B-C and C-D cassettes; and the cloning approach to obtain 
constrcuts equivalent to those reported in Fig. 6. 

Example 70 
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Preparation of the rapamycin fragments A, B, C and D 
Primers 5 ' -TTTTTGAATTCGGTACCAGCCGACGGCGA-3 1 (SEQID N°15) 
and 5 1 -TTTTTGGATCCCTGTTCCACCAGCGCACC-3 1 (SEQID 16) are used 
to amplify a 1.2 kb fragment from cos58; primers 5'- 
5 TTTTTTCTAGACCGTCGTCGGTGGTTCTC-3 ' ( SEQID N° 17) and 5 1 - 
TTTTTGGATCCAGGAAGCCCTGTGCTGTC-3 1 (SEQID N°18) a 1.2 kb 
fragment from cos58; primers 5 1 - 

TTTTTTGTAGAGGTCAAGATCCGGGGCAT-3 1 (SEQID N°19) and 5 f - 
TTTTTCTGCAGGACAGCGCCCTTGAGGTG- 3 ' (SEQID N°20) a 1.2 kb 

10 fragment from cos25; and primers 5 1 -TTT- 

TTCTGCAGGCGACGAAGAGGGGC-3 ! (SEQID N°21) and 5'- 

TTTTTAAGCTTAGCGCGACCGGGGCGGT-3 1 (SEQID N°22) a 0.9 kb 
fragment from cos2. Fragment A, B, C and D are then 
digested with EcoRI + BamRI, BamHI + Xbal, Xbal + PstI, and 

15 PstI + Hi/idIII f respectively. 

Example 71 

Construction of plasmid pURl 

Fragments A and B, prepared as described in Example 70, are 
20 ligated to pUC18 digested with EcoRI + Xbal. The resulting 
mixture contains the desired plasmid pURl . 

Example 7 2 

Construction of E. coli K12 DHl/pURl 

25 Approximately 10 ng of plasmid pURl, prepared as described 
in Example 71, are used to transform E. coli DHl and a few 
of the resulting Ap R colonies that appear on the LB-agar 
plates are analyzed for their plasmid content. One colony 
is found to carry pURl f as verified by the observation, 

30 upon agarose gel-electrophoresis, of fragments of 2.7 and 
2.4 kb after digestion of the plasmid with EcoRI + Xbal. 

Example 7 3 
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Construction of plasmid pUR2 

Fragments B and C, prepared as described in Example 70, are 
ligated to pUC18 digested with BamHI + Pstl. The resulting 
mixture contains the desired plasmid pUR2 . 

5 

Example 7 4 

Construction of E. coli K12 DHl/pUR2 

Approximately 10 ng of plasmid pUR2, prepared as described 
in Example 73, are used to transform E. coli DH1 and a few 
10 of the resulting Ap R colonies that appear on the LB-agar 
plates are analyzed for their plasmid content. One colony 
is found to carry pUR2, as verified by the observation, 
upon agarose gel-electrophoresis , of fragments of 2.7 and 
2.4 kb after digestion of the plasmid with BamRI + Pstl. 

15 

Example 75 

Construction of plasmid pUR3 

Fragments C and D, prepared as described in Example 70, are 
ligated to pUC18 digested with Xbal + Hindlll. The 
20 resulting mixture contains the desired plasmid pUR3 . 

Example 7 6 

Construction of E. coli K12 DHl/pUR3 

Approximately 10 ng of plasmid pUR3, prepared as described 
25 in Example 75, are used to transform E. coli DH1 and a few 
of the resulting Ap R colonies that appear on the LB-agar 
plates are analyzed for their plasmid content. One colony 
is found to carry pUR3, as verified by the observation, 
upon agarose gel-electrophoresis, of fragments of 2.7 and 
30 2.1 kb after digestion of the plasmid with EcoRI + Hindi II. 

Example 77 

Construction of plasmid pURll 
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Plasmid pURl , prepared as described in Example 72 and 
previously digested with BamHI, and the tet fragment, 
prepared as described in Example 31, are treated with T4 
DNA Polymerase and DNA ligase. The ligation mixture 
5 contains the desired plasmid pURll. 

Example 7 8 

Construction of £. coli K12 DHl/pURll 

Approximately 10 ng of plasmid pURll, prepared as described 
10 in Example 77 , are used to transform E. coli DH1 and a few 
of the resulting Tc R Ap R colonies that appear on the LB-agar 
plates are analyzed for their plasmid content. One colony 
is found to carry pURll, as verified by the observation, 
upon agarose gel-electrophoresis , of fragments of 3.9 and 
15 2.8 kb after digestion of the plasmid with Hindi II. 

Example 7 9 

Construction of the plasmid pUR21 

Plasmid pUR2, prepared as described in Example 7 4 and 
20 previously digested with Xbal, and the tet fragment, 
prepared as described in Example 31, are treated with T4 
DNA Polymerase and DNA ligase. The ligation mixture 
contains the desired plasmid pUR21. 

25 Example 80 

Construction of E. coli K12 DHl/pUR21 

Approximately 10 ng of plasmid pUR21, prepared as described 
in Example 79, are used to transform E. coli DH1 and a few 
of the resulting Tc R Ap R colonies that appear on the LB-agar 
30 plates are analyzed for their plasmid content. One colony 
is found to carry pUR21, as verified by the observation, 
upon agarose gel-electrophoresis, of fragments of 3.9 and 
2.8 kb after digestion of the plasmid with Hindi II. 
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Example 81 

Construction of the plasmid pUR31 

Plasmid pUR3, prepared as described in Example 7 6 and 
5 digested with PstI, and the tet fragment , prepared as 
described in Example 31, are treated with T4 DNA Polymerase 
and DNA ligase. The ligation mixture contains the desired 
plasmid pUR31. 

10 Example 82 

Construction of E. coli K12 DHl/pUR31 

Approximately 10 ng of plasmid pUR31, prepared as described 
in Example 81, are used to transform E. coli DH1 and a few 
of the resulting Tc R Ap R colonies that appear on the LB-agar 
15 plates are analyzed for their plasmid content. One colony 
is found to carry pUR31, as verified by the observation, 
upon agarose gel-electrophoresis, of fragments of 3.9 and 
2.5 kb after digestion of the plasmid with Hindlll. 

20 Example 83 

Construction of plasmid pUR13 

The 4.0 kb EcoRI-Xbal fragment obtained from plasmid pURll, 
prepared as described in Example 78 , and the 2.1 kb Xbal- 
Hindlll fragment obtained from plasmid pUR3, prepared as 
25 described in Example 76, are ligated to pUC18 digested with 
EcoRI + Hindlll. The ligation mixture contains the desired 
plasmid pUR13. 

Example 84 

30 Construction of E. coli K12 DHl/pUR13 

Approximately 10 ng of plasmid pUR13, prepared as described 
in Example 83, are used to transform E. coli DH1 and a few 
of the resulting Tc R Ap R colonies that appear on the LB-agar 
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plates are analyzed for their plasmid content. One colony 
is found to carry pUR13, as verified by the observation, 
upon agarose gel-electrophoresis , of fragments of 4.9 and 
3.9 kb after digestion of the plasmid with Hindlll. 

5 

Those of ordinary skill in the art understand that the 
plasmids constructed above can be used for transfering the 
two-fragment cassettes present in pURll, pUR21 and pUR31 
into a ts vector. This can be achieved by recovering the 

10 4.0 kb insert from pURll, the 4.0 kb insert from pUR21, and 
the 3.7 kb insert from pUR31, after digestion with EcoRI + 
Xbal, EcoRI + PstI, and EcoRI + Ndel, respectively. 
Similarly, those of ordinary skill in the art understand 
that the 6.1 kb four-fragment cassette present in plasmid 

15 pUR13 can be easily transfered into pPAOSl after digestion 
with EcoRI - Ndel. These subcloning experiments lead to the 
formation of plasmids equivalent to those reported in Fig. 
6. 

As another application of the principles and 
20 methodologies of the present invention, the Examples 
reported below describe the preparation of the appropriate 
fragments A, B, C and D trora the Sac. erythraea 
erythromycin gene cluster. This cluster has been described 
and is contained within a series of overlapping clones 
25 (Tuan et al., 19'90, Gene 90:21-29; Donadio et al., 1993, In 
Industrial Microorganisms: Basics and Applied Genetics, 
Baltz, Hegeman and Skatrud eds., ASM, Washington, DC, 
pp. 257-265; Pereda et al., 1997, Gene 193:65-71). The 
construction of the plasmids containing the A-B, B-C and C- 
30 D cassettes and; the cloning approach to obtain constructs 
equivalent to those reported in Fig. 6 are also described. 

Example 85 
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Preparation of the erythromycin fragment A 

Synthetic oligonucleotides 5 * -CATGGGAATTCGGGGG-3 ' ( SEQID 
N°23) and 5 ' -CCCCCGAATTCC-3 1 (SEQID N°24) are annealed and 
ligated to the 1.2 kb NcoI-BamHI fragment isolated from 
5 cosmid p3B2. The resulting mixture is digested with £coRI + 
BamHI . 

Example 8 6 

Preparation of the erythromycin fragments B, C and D 
10 Primers 5 1 -TTTTTGGATCCGGGGCAGCGGTTGGTTCC-3 1 (SEQID N°25) 
and 5 1 -TTTTTTCTAGAAGGCAGCTCCAGATGATC-3 1 (SEQID N°2 6) are 
used to amplify a 1.0 kb fragment from cosmid p3B2; primers 
5 1 -TTTTTCTAGACCGGACTCGGCCGGCTCG-3 ' (SEQID. N°27) and 5'- 
TTTTTCTGCAGCCGCACGCCTCGGTGGTC- 3 ' (SEQID N°23) a 1.1 kb 
15 fragment from cosmid pSl; and primers 5'- 
TTTTTCTGCAGGGACCCTGAGTGCAGGTG- 3 1 (SEQID N°29) and 5 1 - 
TTTTTAAGCTTCAGTAGCCGTCGCTGAGC-3 1 (SEQID N°30) a 1.1 kb 
fragment from plasmid pEB6. Fragments B, C and D are then 
digested with BamEI + Xbal, Xbal + Pstl, and PstI + 
20 Hindlll, respectively. 

Example 87 

Construction of plasmid pUEl 

Fragment A, prepared as described in Example 85, and 
25 fragment B, prepared as described in Example 86, are 
ligated to pUC18 digested with EcoRI + Xbal. The resulting 
mixture contains the desired plasmid pUEl. 

Example 88 

30 Construction of E. coli K12 DHl/pUEl ; 

Approximately 10 ng of plasmid pUEl, prepared as described 
in Example 87, are used to transform E. coli DH1 and a few 
of the resulting Ap R colonies that appear on the LB-agar 
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plates are analyzed for their plasmid content. One colony 
is found to carry pURl, as verified by the observation, 
upon agarose gel-electrophoresis , of fragments of 2.7 and 
2.2 kb after digestion of the plasmid with EcoRI + Xbal. 

5 

Example 8 9 

Construction of plasmid pt)E2 

Fragments B and C, prepared as described in Example 06, are 
ligated to pUC18 digested with BamHI + Pstl. The resulting 
10 mixture contains the desired plasmid pUE2 . 

Example 90 

Construction of E. coli K12 DHl/pUE2 

Approximately 10 ng of plasmid pUE2, prepared as described 
15 in Example 89, are used to transform E. coli DH1 and a few 
of the resulting Ap R colonies that appear on the LB-agar 
plates are analyzed for their plasmid content. One colony 
is found to carry pUE2, as verified by the observation, 
upon agarose gel-electrophoresis, of fragments of 2.7 and 
20 2.1 kb after digestion of the plasmid with BamEI + Pstl. 

Example 91 

Construction of plasmid pUE3 

Fragments C and D, prepared as described in Example 86, are 
25 ligated to pUC18 digested with Xjbal + Hindi II. The 
resulting mixture contains the desired plasmid pUE3. 

Example 92 

Construction of E. coli K12 DHl/pUE3 
30 Approximately 10 ng of plasmid pUE3, prepared as described 
in Example 91, are used to transform E. coli DH1 and a few 
of the resulting Ap R colonies that appear on the LB-agar 
plates are analyzed for their plasmid content. One colony 
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is found to carry pUE3, as verified by the observation, 
upon agarose gel-electrophoresis f of fragments of 2.7 and 
2.2 kb after digestion of the plasmid with EcoRI + ffindlll. 

5 Example 93 

Construction of plasmid pUEll 

Plasmid pUEl, prepared as described in Example 88 and 
previously digested with BamHI, and the Let fragment, 
prepared as described in Example 31, are treated with T4 
10 DNA Polymerase and DNA ligase. The ligation mixture 
contains the desired plasmid pUEll. 

Example 94 

Construction of E. coli K12 DHl/pUEll 

15 Approximately 10 ng of plasmid pUEll, prepared as described 
in Example 93, are used to transform E. coli DH1 and a few 
of the resulting Tc R Ap R colonies that appear on the LB-agar 
plates are analyzed for their plasmid content. One colony 
is found to carry pUEll, as verified by the observation, 

20 upon agarose gel-electrophoresis, of fragments of 3.9 and 
2.6 kb after digestion of the plasmid with ffindlll. 

Example 95 

Construction of the plasmid pUE21 
25 Plasmid pUE2, prepared as described in Example 90 and 
previously digested with Xbal r and the tet fragment, 
prepared as described in Example 31, are treated with T4 
DNA Polymerase and DNA ligase. The ligation mixture 
contains the desired plasmid pUE21. 

30 

Example 96 

Construction of E. coli K12 DHl/pUE21 

Approximately 10 ng of plasmid pUE21, prepared as described 
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in Example 95, are used to transform E. coli DH1 and a few 
of the resulting Tc R Ap R colonies that appear on the LB-agar 
plates are analyzed for their plasmid content. One colony 
is found to carry pUE21, as verified by the observation, 
5 upon agarose gel-electrophoresis, of fragments of 3.7 and 
2.7 kb after digestion of the plasmid with Hindlll. 

Example 97 

Construction of the plasmid pUE31 
10 Plasmid pUE3, prepared as described in Example 92 and 
digested with PstI, and the tet fragment, prepared as 
described in Example 31, are treated with T4 DNA Polymerase 
and DNA ligase. The ligation mixture contains the desired 
plasmid pUE31. 

15 

Example 98 

Construction of E. coli K12 DHl/pUE31 

Approximately 10 ng of plasmid pUE31, prepared as described 
in Example 97 , are used to transform E. coli DH1 and a few 
20 of the resulting Tc R Ap R colonies that appear cn the LB-agar 
plates are analyzed for their plasmid content. One colony 
is found to carry pUE31, as verified by the observation, 
upon agarose gel-electrophoresis, of fragments of 3.8 and 
2.7 kb after digestion of the plasmid with Hindlll. 

25 

Example 99 

Construction of plasmid pUE13 

The 3.8 kb EcoRI-Xbal fragment obtained from plasmid pUEll, 
prepared as described in Example 94, and the 2.2 kb Xbal- 
30 Hindlll fragment obtained from plasmid pUE3, prepared as 
described in Example 92, are ligated to pUC18 digested with 
£coRI + Hindlll. The ligation mixture contains the desired 
plasmid pUE13. 
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Example IOC 

Construction of E. coll K12 DHI/pUE13 

Approximately 10 ng of plasmid pUE13, prepared as described 
5 in Example 99, are used to transform E. coli DH1 and a few 
of the resulting Tc R Ap R colonies that appear on the LB-agar 
plates are analyzed for their plasmid content. One colony 
is found to carry pUE13, as verified by the observation, 
upon agarose gel-electrophoresis, of fragments of 4.8 and 
10 3.9 kb after digestion of the plasmid with Hindlll. 

Those of ordinary skill in the art understand 
that the plasmids constructed above can be used for 
transfering the two-fragment cassettes present in pUEll, 

15 pUE21 and pUE31 into a ts vector. This can be achieved by 
recovering the 3.8 kb insert from pUEll, the 3.7 kb insert 
from pUE21, and the 3.8 kb insert from pUE31, after 
digestion with £coRI + Xbal, EcoRI + Pstl , and EcoRI + 
Ndel, respectively. Similarly, those of ordinary skill in 

20 the art understand that the 6.0 kb four-fragment cassette 
present in plasmid pUE13 can be easily transfered into 
pPAC-Sl after digestion with EcoRI + Ndel. These subcloning 
experiments lead to the formation of plasmids equivalent to 
those reported in Fig. 6. 

25 The Examples reported above describe the principle and 

methodologies for assembling the erythromycin gene cluster 
into the pPAC-Sl. It will occur to those skilled in the art 
that the principles and methodologies illustrated in Fig. 7 
and described in Examples 52 through 69 can be applied to 

30 the erythromycin gene cluster, employing the pMAK7 05 
derivatives constructed according to the principles 
described above and the eryhtomycin cosmids. 

As a further example, the preparation of the 
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appropriate fragments A, B, C and D from the A. 
mediterranei rifamycin gene cluster is illustrated below. 
This cluster has been described and is contained within a 
series of overlapping clones (August et al . , 1998, Chem. 
5 Biol. 5:69-79) . 

Example 101 

Preparation of the rifamycin fragments A, B, C and D 
Primers 5 1 -TTTTTGAATTCTGCAGACCGCCGAGGAAG-3 1 -SEQID N°31) 
10 and 5 1 -TTTTTGGATCCGGAGTCGTAGCTGACGAC-3 1 (SEQID N°32) ; 5'- 



TTTTGGATCCCGACCACGCGGGGACGTC-3 1 


(SEQID 


N°33) 


and 


5'- 


TTTTTTCTAGACCAGGGAACCCGTGCTGC-3 1 


(SEQID 


N°34) ; 


5' 




TTTTTTCTAGACGGAAGCTCGCCGCGATC-3 ' 


(SEQID 


N°35) 


and 


5'- 


TTTTTCTGCAGGTCCGTAGCCCGGACACC-3 1 (SEQID 


N°36) ; 


and 


5 1 - 


TTTTTCTGCAGTTCGGGCGACAGTTCCT7-3 1 


(SEQID 


N°37) 


and 


5'- 


TTTTTAAGCTTCAACAAGCCATCCGGGTC-3 1 


(SEQID 


N°38 ) , are used 


to 


amplify fragments of 1.2, 


1.2, 1 


. 2 and 


1.0 


kb, 



• respectively, from A. mediterranei genomic DMA. Fragments 
A, B, C and D are then digested with £coRI + BainHI, BamHI + 
20 Xbal f Xbal + PstI, and PstI + Hindlll, respectively. 

Those of ordinary skills in the art understa nd that 
the fragments generated from the rifamycin gene cluster 
contain the same restriction sites as those generated from 

25 ' the rapamycin and erythromycin gene clusters, so that the 
same cloning strategies for generating the pUC18 
derivatives containing the A-B, B-C and C-D cassettes, 
described above in Examples 72-77 for the rapamycin cluster 
and 88-93 for the erythromycin cluster, can also be applied 

30 ; to the rifamycin cluster. In addition, the rifamycin 
: fragments A r B, C and D have been selected so that the same 
cloning methodologies described above for inserting tet 
within the A-B, B-C and C-D cassettes from the rapamycin 
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and erythromycin clusters, described in Examples 78-83 and 
94-99, respectively, can be applied in this instance as 
well . Furthermore, the construction of the four- fragment 
cassette can also make use of the same cloning strategy. 
5 Therefore, following the same principles and methodologies 
described in detail for the rapamycin and erythromycin 
clusters in Examples 72-85 and 88-101, respectively, 
plasmids equivalent to those reported in Fig. 6 can be 
constructed for assembling the rifamycin cluster into the 

10 pPAC-Sl. It will occur to those skilled in the art that the 
principles and methodologies illustrated in Fig. 7 and 
described in Examples 52-69 for the P. rosea cluster can be 
applied to any gene cluster, once the appropriate pMAK705 
derivatives have been constructed, employing available 

15 overlapping clones. 

Thus, as schematized in Fig. 7, interplasmid insert 
exchange can be conducted between any plasmid containing 
the desired region and the cognate ts construct. , Plasmids 
corresponding to pAB2, pBC2 and pCD2 can therefore be 

20 derived from any cluster. Similarly, the principles and 
methodologies illustrated in Fig. 8 can be applied 
employing the appropriate A-B-C-D cassette and the cognate 
pMAK705 derivatives prepared according to the scheme of 
Fig. 7. The principles and methodologies illustrated in 

25 Fig. 7 and Fig. ■ 8 and described in Examples 52-69 can 
therefore be extended to other clusters. 

It will occur to those skilled in the art that, 
although illustrated in Fig. 5 through 8 by three 
overlapping clones and described in the Examples 58 through 

30 69 by the use of five rounds of interplasmid insert 
exchange, the principles and methodologies described in 
this section of the present invention can be extended to a 
different number of overlapping clones. If n is the number 
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of overlapping clones that encompass the desired genomic 
segment, n will also be the number of homologous 
recombination rounds that introduce cluster DNA into the 
pESAC. If an Ab R marker is used to facilitate monitoring 
5 insert exchange , the total number of rounds of homologous 
recombination will be equal to 2n - 2. Interplasmid 
homologous recombination has been described to introduce 
large DNA segments into a desired vector (O'Connor et al., 
1989, Science 244:1307-1312; Kao et al., 1994, Science 
10 265:509-512) or to target a smaller segment into a large 
episome (Yang et a,, 1997, Nature Biotechnol. 15:859-865). 
However, it was not be anticipated that these procedures 
could be applied iteratively for the precise reconstruction 
of very large DNA segments. 

15 

7 . 3 Identification of positive clones 

The principles and methodologies described in Section 7.2 
for obtaining an entire gene cluster in a pESAC rely on the 
identification of the desired genomic segment. When using 

20 the principles and methodologies described in Section 
7.2.1, the desired clones are identified by screening an 
ESAC library with one of the possible strategies described 
below. When using the principles and methodologies 
described in Section 7.2.2, the desired clones are 

25 identified in a genomic library, such* as a cosmid library, 
with one of the possible strategies described below, and 
then assembled into pESAC. The principles and methodologies 
for identifying the genes responsible for the biosynthesis 
of natural products are well described in the literature 

30 and are reported here solely to illustrate the fact that 
they represent a necessary step in the overall scope of the 
present invention . 

The genes involved in the biosynthesis of natural 
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products in actinomycetes are invariably found as gene 
clusters in the chromosome of the producing organism, often 
associated with one or more resistance determinants. 
Consequently, identifying one gene allows ready access to 
5 all the others. One or more genes responsible for the 
biosynthesis of a natural product could have been 
described, or the entire cluster could be known. Several 
biosynthesis clusters from actinomycetes have been reported 
and other clusters are likely to be described in the 

10 future. Suitable probes from the cluster extremities can be 
derived from published clusters, when available. Thus, 
fragments A and D, described in Example 70, can be used as 
probes to screen an ESAC library prepared from S. 
hygroscopicus DNA. ESACs positive to both probes will 

15 contain the rapamycin cluster. Similar strategies can be 
applied to ESAC libraries prepared from Sac. erythraea and 
A. mediterranei DNA, screened with fragments A and D, 
prepared as described in Examples 85-86 and 101, 
respectively . 

20 If no biosynthesis genes are known, different 

strategies for identifying them can be applied. These 
strategies are well described in the literature and are 
summarized below. One possible strategy involves the 
isolation of the resistance gene(s) after cloning in a 

25 heterologous host that is sensitive to that natural product 
(for example, Stanzak et al., 1986, Bio/Technol. 4:229- 
232) . Another possible strategy is based on reverse 
genetics: a particular biosynthetic enzyme is purified, and 
from its partial protein sequence (s) the corresponding gene 

30 is isolated via PCR or hybridization (for example, Fishman 
et al., 1987, Proc. Natl. Acad. Sci. USA 84:8248-8252). 
Another approach relies on the complementation of mutants 
blocked in one or more biosynthesis steps, after 
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introduction of a DNA library constructed in a suitable 
vector into the wild type strain (for example, Malpartida 
and Hopwood, 1984, Nature 309:462-464). Another approach 
involves the construction of an expression library in a 
5 suitable vector in an appropriate host, where the gene 
product is sought after using specific antibodies or 
looking for a particular enzymatic activity (for example, 
Jones and Hopwood, 1984, J. Biol. Chem. 259:14151-14157). 
Another possible approach makes use of heterologous probes 

10 derived from biosynthesis, resistance or regulatory genes. 
Natural products can be broadly grouped into classes 
according to their biosynthetic origin, and for many of 
them suitable probes are available. For example, genes 
encoding aromatic or modular polyketide synthases can be 

15 effectively identified through the use of heterologous 
hybridization probes (Malpartida et al., 1987, Nature 
325:818-821; Schwecke et al., 1995, Proc. Natl. Acad. Sci . 
USA 92:7839-7843); suitable probes have been reported for 
peptide synthetase genes (Turgay and Marahiel, 1994, Pept. 

20 Res. 7 :238-241); for genes involved in the formation or 
attachment of modified sugars (Decker et al. f 1996, FEMS 
Microbiol. Lett. 141:195-201). As the understanding of Lhe 
genetics of natural product biosynthesis increases, other 
heterologous probes will become available. 

25 The size of clusters can be estimated from those of 

known clusters involved in the synthesis of structurally 
similar natural products. For examples, synthesis of 
macrolides is expected to require clusters in the 60-70 kb 
range (Katz and Donadio, 1993, Annu. Rev. Microbiol. 

30 47:875-912; Kuhstoss et al . , 1996, Gene 183:231-236); 
synthesis of glycopeptides, clusters in the 70 kb range 
(van Wageningen et al., 1998, Chem. Biol. 5:155-162). In 
instances where no clusters have been described for the 
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same structural class of natural products, the site of the 
relevant cluster can be estimated from considerations about 
its known or likely biosynthesis route. Once the desired 
cluster has been identified, its extent can be established 
5 by analysis of the DNA sequence of the cloned cluster or of 
parts thereof. Comparison of the DNA sequence to databases 
can allow the identification of the likely borders of the 
gene cluster. 

Employing the above mentioned approaches, the desired 
10 gene cluster can be identified in any library. If an ESAC 
library is used, the identified cluster is ready for 
transfer into the production host. If a smaller fragment 
library is employed, the cluster can be assembled into a 
pESAC. 

15 Those skilled in the art understand that, when an ESAC 

library from a donor organism is constructed, any ESAC can 
be selected from said library and transferred into a 
production host. Therefore, a single donor organism can be 
utilized as the source of several biosynthesis clusters 

20 that can be mobilized into a production host. Similarly, an 
ESAC library needs not be constructed from a single donor 
organism. 

7.4 Transformation of a Streptomyces host 

25 Once the desired gene cluster has been introduced into a 
pESAC, one or more ESACs are introduced into a suitable 
Streptomyces host. This is accomplished by employing 
published procedures for transformation of Streptorayces. 
Only minor modifications from established procedures 

30 (Hopwood et al., 1985, Genetic Manipulation of 
Streptomyces : A Laboratory Manual, The John Innes 
Foundation, Norwich, UK) are required for obtaining a 
sufficient number of transf ormants . Because transformations 
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are performed with single, purified ESACs, transformation 
efficiencies do not need to be particularly high. The 
Examples reported below illustrate ' the principles and 
methodologies for introducing ESACs into S. lividans. They 
5 serve to describe the present invention and are not meant 
to restrict its scope. Streptomyces transf ormants are 
selected for Th R , specified by the tsr marker present in the 
pESAC. Since the incoming DNA is incapable of self- 
replication in Streptomyces , site-specific integration 

10 occurs at the chromosomal attB site, mediated by the ±nt- 
attP function specified by the pESAC. That integration has 
occurred at the proper site can be verified by Southern 
hybridization or by PFGE analysis of the transf ormants . 
Fig. 10 illustrates a PFGE separation of a S. lividans 

15 derivative carrying an ESAC with a 70 kb insert integrated 
into its chromosome. 

Example 102 

Introduction of ESACs into S. lividans ZX7 
20 A few hundred ng of three individual ESACs, prepared as 

described in Example 17 and carrying inserts of S. 

coelicolor DNA of 70, 120 and 140 kb (designated ESAC-70, 

ESAC-120, and ESAC-140, respectively) , are used to 

transform protoplasts of S. lividans ZX7 . The colonies that 
25 appear on the R2YE plates, after overlaying with Th, are 

analyzed for their Th R by streaking them on fresh R2YE 

plates . 

Example 105 

30 Cultivation and preservation of S. lividans ZX7/ESAC 

Individual colonies of S. lividans ZX7 transf ormants with 
the individual ESACs, prepared as described in Example 102, 
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are grown for several passages in solid medium without and 
with Th. Spore suspension, or mycelium prepared after 
cultivation in JM or YEME medium with Th, are stored at - 
80°C after addition of glycerol to 20% (v/v) . 

5 

Example 104 

Characterization of S. lividans ZX7 attB: :ESAC-70 
Individual colonies of S. lividans ZX7 attB: :ESAO70, 
prepared as described in Example 102, are grown in YEME and 

10 total genomic DNA is prepared. The DNA is digested with 
BamRI, resolved by agarose gel-electrophoresis, and 
transferred onto a membrane. Hybridization to labeled pPAC- 
SI DNA, prepared as described in Example 11, reveals the 
appearance of three bands of approximately 16, 8 and 2.7 

15 kb. PFGE analysis of genomic DNA reveals the disappearance 
of a 2.5 Mb "Oral fragment present in ZX7 and the appearance 
of two fragments of 1.4 and 1.1 Mb (Fig. 10). 

Although the present invention is described in the 
20 Examples listed above in terms of preferred embodiments f 
they are not to be regarded as limiting the scope of the 
invention. The above Examples serve to illustrate the 
principles and methodologies for introducing ESACs into S. 
lividans, for cultivating the resulting transf ormants and 
25 for confirming their genotype. The above Examples serve to 
illustrate the principles and methodologies for the 
transformation of S. lividans with ESACs carrying DNA 
inserts from a different species. It will occur to those 
skilled in the art that additional ESACs, either containing 
30 ■ different inserts of S. coelicolor DNA, prepared as 
described in Example 17, or carrying DNA inserts from other 
actinomycetes can be used to transform S. lividans ZX7 . As 
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another example transfer of large DNA segments, the 
transformation of S. lividans with a P. rosea gene cluster 
is illustrated below. Confirmation of the correct genotype 
of the resulting transf ormants is illustrated in Fig. 11. 

5 

Example 105 

Construction S. lividans ZX7 attB: :PAD6 

A few hundred ng of PAD6, prepared as described in Example 
69, are used to transform protoplasts of S. lividans ZX7 . 
10 The colonies that appear on the R2YE plates, after 
overlaying with Th, are analyzed for their Th R by streaking 
them on fresh R2YE plates. 

Example 10 6 

Characterization of S . lividans 2X7 attB::PAD6 
Individual colonies of S. lividans ZX7 attB: :PAD6, prepared 
as described in Example 105, are grown in YEME medium and 
total DNA is prepared. The DNA is digested with BamHI, 
resolved by agarose gel-electrophoresis and transferred 
onto a membrane. Hybridization to labeled PAD 6 il 
illustrated in Fig. 11. Bands of 22, 10, 7.6, 7.2, 6.2, 
5.6, 5.2, 3.1, 3.0, 2.8, 2.7, 2.6, 2.5, 2.1, 1.9, 1.9, 1.8, 
1.6, 1.5, 1.4, 1.2, 1.0, 1.0, 0.9, 0.9, 0.9, 0.7, 0.6, 0.5, 
0.5, 0.3 and 0.1 kb . The profile of P. rosea DNA is shown 
for comparison. 

Those skilled in the art understand that S. lividans 
ZX7 attB: :PAD6 contains the expected number and size of 
bands expected from transfer of the cluster of Fig. 9 via 
30 PAD6. In analogy "to the above Examples, the rapamycin, 
erythromycin and rifamycin clusters assembled in pESAC, 
according to the principles and methodologies described in 
Section 7.2.2, can be used to transform S. lividans. It 
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will occur to those skilled in the art that other S. 
lividans strains can be equally used as hosts for 
transformation with ESACs. Furthermore , G>C31 can lysogenize 
other Streptomyces species, in addition to S. lividans . 
5 These include but are not limited to the species reported 
in Table 2. Furthermore, a <1>C31 attB site may be engineered 
into Streptomyces species or other actinomycetes that are 
not naturally lysogenized by phage G>C31. Therefore, any 
ESAC, prepared according to the principles and 
10 methodologies of Section 7.2, and any natural or engineered 
actinomycete host, fall within the scope of the present 
invention. 

It will occur to those skilled in the art that 
alternative methods for introducing DNA into an 

15 actinomycete host can be employed. These include but are 
not limited to electroporation {MacNeil, 198 9, FEMS 
Microbiol. Lett. 42:239-244) and conjugation from E. coli 
(Mazodier et al., 1989, J. Bacterid." 171:3583-3585). It 
will also occur to those skilled in the art that 

20 alternative media and growth conditions can be employed for 
cultivating the transf ormants , and that they can be 
analyzed by different methods than those described above. 
Technical variations on the methodologies described above 
can produced equivalent results. Ail these variations fall 

25 within the scope of the present invention. 



SUBSTITUTE SHEET (Rule 26) 



WO 99/67374 



PCT/EP99/04079 



85 



Table 2 

5 List of exemplary species of Streptomyces and other genera 
of Actinomycetales allowing a ttP-mediated integration of 
<t>C31 (Hopwood et al., 1985, Genetic Manipulation of 
Streptomyces: A Laboratory Manual, The John Innes 
Foundation, Norwich, UK; Lomovskaya et al., 1997, 
10 Microbiol. 143:875-883; Kuhstoss et al., 1991, Gene 97:143- 
146; Soldatova et al., 1994, Antibiot. Khimioter . 39 : 3-7 ) . 



Streptomyces coellcolor 
15 Streptomyces lividans 

Streptomyces hygroscopicus 
Streptomyces bambergiensis 
Streptomyces ambofaciens 
Streptomyces griseofuscus 
20 Streptomyces lipmanii 

Streptomyces thermotolerans 
Streptomyces clavuligerus 
Streptomyces fradiae 
Saccharopolyspora spinosa 
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7 . 5 Growth of the recombinant Streptomyces and metabolite 
production 

When an ESAC, introduced into a production host according 
to the principles and methodologies described in Section 
5 7.4, carries the entire biosynthesis gene cluster derived 
from a donor organism, the recombinant strain produces the 
relevant natural product. Naive actinomycete hosts have 
been shown to produce the appropriate natural product or 
its intermediate (s) when the relevant DNA was introduced 

10 into them (Malpartida and Hopwood, 1984, Nature 309:462- 
464; Hong et al . , 1997, J. Bacteriol. 179:470-476; Kao et 
al., 1994, Science 265:509-512). Thus, transf ormants of 
Streptomyces and other actinomycete species carrying the 
relevant biosynthesis clusters are expected to produce the 

15 corresponding natural product. The recombinant production 
hosts are cultivated in a suitable medium and the presence 
of the relevant metabolites is determined following 
appropriate procedures, which may include biological 
assays, chromatographic properties, MS, NMR, etc. 

20 It will occur to those skilled in the art that ESACs, 

containing the relevant biosynthesis cluster derived from 
any donor actynomycete, can be used to transform S. 
lividans. The resulting transf ormants will produce the 
corresponding natural product. For example, an ESAC 

25 carrying the rapamycin, erythromycin or rifamycin cluster, 
prepared according to the principles of Section 7.2, can be 
used to transform S. llvidans and rapamycin, erythromycin 
or rifamycin, respectively, can be produced by the 
resulting recombinant strain. Furthermore, it will occur to 

30 those skilled in the art that other Streptomyces or 
actinomycete strains that naturally contain or have been 
engineered to contain a phage OC31 attB site, can be used 
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as production hosts for desired natural products. 
Therefore, any natural product produced after introduction 
of the relevant cluster carried on ESAC into a suitable 
production host, falls within the scope of the present 
5 invention. 

The present invention describes principles and 
methodologies for optimizing and speeding up the process of 
lead optimization and development in drug discovery. These 
can be applied since the early stages of drug discovery as 

10 briefly summarized herein. A natural product produced by a 
donor organism has an interesting property, such as 
antibacterial, antifungal r antitumor, antihelmintic, 
herbicidal, immunosuppressive, or other pharmacological 
activity. The potential is seen for increasing the 

15 productivity of the producing organism, and/or for 
improving the biological or physico-chemical properties of 
said natural product after manipulating its structure. The 
biosynthetic pathway for the natural product is inferred 
from its chemical structure. This leads to a hypothesis on 

20 the genes involved, including the approximate size of the 
corresponding cluster. A large insert library is 
constructed in the pESAC vectors described herein using 
genomic DNA prepared from the donor organism. Through a 
judicious choice of hybridization probes and PCR primers, 

25 the desired cluster is identified in the library. 
Alternatively, the cluster is assembled into the pESAC 
vectors described herein from overlapping cosmid clones 
identified by hybridization as above. The selected clone (s) 
arc transferred into S. lividans, S. coelicolor or other 

30 suitable strain, and the resulting transf ormants are 
evaluated for production of the natural product. 

Once production is obtained, the desired genetic, 
physiological and technological manipulations can be 
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performed on the production host/ employing well-developed 
methodologies. The bioactive molecule is purified from a 
known host, amid a background of known metabolites. If 
necessary, ad hoc mutations can be conveniently introduced 
5 in the production host to eliminate unwanted, interfering 
products. Because of the deeper knowledge on the 
physiological processes and regulatory networks for 
secondary metabolism in the production host compared to the 
donor organism, targeted approaches to strain improvement, 

10 using classical and molecular techniques, can be applied. 
Furthermore, well-characterized mutant strains are 
available for the production host, and the desired ESAC 
could be easily introduced into different genetic 
backgrounds. In addition, the biosynthetic pathway can be 

15 easily manipulated, because of the availability of the 
cloned genes and the efficient genetic tools for the 
production host. Finally, additional specialized genes or 
even entire clusters can be introduced into the production 
host, further expanding the possible applications of the 

20 present invention. 

As it is apparent from the above description, a 
further object of this invention is to provide a process 
for the procduction of a natural product by cultivating an 
actinomycete strain capablre of producing said natural 

25 product in the presence of a nutrient medium, isolating and 
purifying said natural product, characterized in that the 
actinomycete strain capable of producing said natural 
product ( production host) is an actinomycete strain 
modified by means of an E. coli-Streptomyces Artificial 

30 Chromosome that carries a gene cluster governing the 
biosynthesis of said natural product derived from an 
actinomycete donor organism which is the original producer 
of said natural product, according to the method described 
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herein. Preferably, said modified actinomycete strain shall 
be a Streptomyces lividans or Streptomyces coelicolor 
strain. 

Finally, even in a case where the natural product may 
not be made by the production host after transfer of the 
relevant cluster, appropriate tools are available to remedy 
that situation. Lack of production of the expected natural 
product might be due to several possibilities: absence of 
required gene(s); DNA, gene product or natural product 
instability; inadequate levels of gene expression or of 
appropriate precursors. In a well-defined production host, 
each of these possible causes may be directly investigated 
and remedied. 

Therefore , the present invention provides significant 
advantages over the existing process of drug discovery and 
development, including production. It exploits the fact 
that the host where the natural product will be produced is 
an organism commonly used for process development and 
genetic manipulation, with substantial information 
available, including safety for handling it. 
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CLAIMS 

We claim: 

1) A method for transferring the production of a 
natural product from an actinomycete donor organism that is 

5 the original producer of said natural product to a 
different actinomycete host, where this transfer is 
achieved by means of an £. coli-Streptomyces Artificial 
Chromosome that carries a gene cluster governing the 
biosynthesis of said natural product derived from said 
10 donor organism. 

2) A method as in claim 1 comprising the steps of: 

(a) isolating large fragments of chromosomal DNA of 
the actinomycete donor organism of a size which 
encompasses the gene cluster that directs the 

15 biosynthesis of the natural product; 

(b) constructing a suitable vector capable of 
accomodating said large fragments of chromosomal 
DNA and of introducing and stably maintaining said 
large fragments of DNA into an E. coli host; 

20 (c) constructing an E. coli-Streptomyces Artificial 

Chromosome by inserting said large fragments of 
chromosomal DNA of step (a) into the above said 
vector of step (b) and selecting the E. coli- 
Streptomyces Artificial Chromosome comprising the 

25 entire gene cluster construct that directs the 

biosynthesis of the above said natural product; 
(d) transforming an actinomycete host different from 
the donor actinomycete host with the E. coli- 
Streptomyces Artificial Chromosome of step (c) 

30 that carries the gene cluster governing the 

biosynthesis of said natural product wherein the 
actinomycete host carries a region which is 
specific for the integration of the E. coli- 

SUBSTTTUTE SHEET (Rule 26) 



WO 99/67374 PCT/EP99/04079 

91 

Streptomyces Artificial Chromosome. 

3) A process as in claim 2 wherein the large fragments 
of genomic DNA of the actinomycete donor organism of step 
(a) are obtained by partial digestion of the chromosomal 

5 DNA of said actinomycete donor organism. 

4) A process as in claim 2 wherein the large fragments 
of the genomic DNA of step (a) are obtained by 
reconstruction through interplasmid homologous 
recombination from a set of pre-existing smaller segments 

10 of partially overlapping DNA cloned from the genome of the 
actinomycete donor organism, which set of segments 
encompass the entire gene cluster that directs the 
biosynthesis of said natural product. 

5) A process as in claim 2, 3 or 4 wherein the 
15 suitable vector of step (b) contains an int-attP region, 

where the int insert preferably derives from phage $C31. 

6) A process as in claim 5 wherein the suitable vector 
of step (b) is the plasmid pPAC-Sl or pPAC-S2 (Fig. 2) . 

7) A process as in claim 2 wherein the E. coli- 
20 Streptomyces Artificial Chromosome is the plasmid pPAC-Sl 

or pPAC-S2 according to claim 6 modified by insertion of 
the entire gene cluster that directs the biosynthesis of 
the natural product. 

8) A process as in claim 5 wherein the integration of 
25 the E. coli-Streptomyces Artificial Chromosome into the 

actinomycete host occurs at the attB site carried by said 
actinomycete host and is mediated by the int-attP function 
specified by the E. coli-Streptomyces Artificial Chromosome 

9) A process as in claim 2, 3, 4, 5, 6, 1 or 8 wherein 
30 the actinomycete host is a Streptomyces lividans strain. 

10) An actinomycete production host that is 
constructed from an actinomycete host by transfer of a 
cluster from a donor organism according to claim 1 or 2 . 
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11) An actinomycete production host as in claim 10 
that is a Streptomyces lividans strain. 

12) A modified strain obtained by genetic, 
physiological or technological manipulation of a production 

5 host of claim 10. 

13) A modified production host as in claim 12 that is 
a Streptomyces lividans strain. 

14) An E- coli-Streptomyces Artificial Chromosome that 
carries a gene cluster directing the biosynthesis of a 

10 natural product. 

15) An E. coli-Streptomyces Artificial Chromosome of 
claim 14 that contains an int-attP region and a selection 
marker . 

16) An E. coli-Streptomyces Artificial Chromosome of 
15 claim 15 that is the vector pPAC-Sl modified by insertion 

of a gene cluster directing the biosynthesis of a natural 
product . 

17) An E. coli-Streptomyces Artificial Chromosome of 
claim 15 that is the vector pPAC-S2 modified by insertion 

20 of a gene cluster directing the biosynthesis of a natural 
product . 

18) An E. coli-Streptomyces Artificial Chromosome as 
in claim 14 that is the construct PAD6 . 

19) An actinomycete production host as in claim 10 
25 that carries the construct PAD6. 

20) An actinomycete production host as in claim 19 
that is a Streptomyces lividans strain. 

21) A modified production host that is obtained by 
genetic, physiological or technological manipulation of the 

30 production host of claim 20. 

22) An E. coli-Streptomyces Artificial Chromosome as 
in claim 14 that carries a gene cluster from Planobispora 
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rosea 

23) An actinomycete production host as in claim 10 
that carries a gene cluster from Planobispora rosea. 

24) An actinomycete production host as in claim 10 
5 that contains the E. coli-Streptomyces Artificial 

Chromosome carrying the raparaycin gene cluster. 

25) An actinomycete production host as in claim 24 
that is a Streptomyces lividans strain. 

2 6) A modified production host obtained by genetic, 
10 physiological or technological manipulation of the 
production host of claim 24. 

27) A modified production host as in claim 26 that is 
a Streptomyces lividans strain. 

28) An E. coli-Streptomyces Artificial Chromosome as 
15 in claim 14 that carries the rapamycin gene cluster. 

29) An E. coli Streptomyces Artificial Chromosome as 
in claim 28 that is the vector pPAC-Sl or pPAC-S2 modified 
by insertion of the gene cluster directing the biosynthesis 
of rapamycin. 

20 30) An actinomycete production host as in claim 10 

that contains the E. coli-Streptomyces Artificial 
Chromosome carrying the erythromycin gene cluster. 

31) An actinomycete production host as in claim 30 
that is a Streptomyces lividans strain. 
25 32) A modified production host obtained by genetic, 

physiological or technological manipulation of the 
production host of claim 30. 

33) A modified production host as in claim 32 that is 
a Streptomyces lividans strain. 
30 34) An E. coli-Streptomyces Artificial Chromosome as 

in claim 14 that carries the erythromycin gene cluster. 

35) An E. coli-Streptomyces Artificial Chromosome as 
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in claim 34 that is the vector pPAC-Sl or pPAC-S2 modified 
by insertion of the gene cluster directing the biosynthesis 
of erythromycin. 

36) An actinomycete production host as in claim ID 
5 that contains the E. coli-Streptomyces Artificial 

Chromosome that carries the rifamycin gene cluster. 

37) An actinomycete production host as in claim 36 
that is a Streptojnyces lividans strain. 

38) A modified production host obtained by genetic, 
10 physiological or technological manipulation of the 

production host of claim 36. 

39) A modified production host as in claim 38 that is 
a Streptomyces lividans strain. 

40) An E. coli-Streptomyces Artificial Chromosome as 
15 in claim 14 that carries the rifamycin gene cluster. 

41) An E. coli-Streptomyces Artificial Chromosome as 
in claim 4 0 that is the vector pPAC-Sl or pPAC-S2 modified 
by insertion of the gene cluster that direct the 
biosynthesis of rifamycin. 

20 42) A process for the production of a natural product 

by cultivating an actinomycete strain capable of producing 
said natural product in the presence of nutrient medium, 
isolating and purifying said natural product, characterized 
in that the actinomycete strain capable of producing said 

25 natural product is a an actinomycete production host 
obtained according to the method of claim 1 or 2 . 

43) A process as in claim 42 wherein the actinomycete 
production host is a Streptomyces lividans or Streptojnyces 
coelicolor strain. 

30 44) A process as in claim 42 wherein the production 

host is one of those described in any of claims 23, 24 , 25, 
26, 27, 30,31, 32, 33, 36, 37, 38 or 39. 

45) A process as in claim 42, for the production of a 
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natural product selected from rapamycin, erythromycin and 
rif amycin. 
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SEQUENCE LISTING 
<110> Biosearch Italia Spa 

<120> Methods for transferring the capability to produce 
natural product into a suitable production host 

<130> G67773RS/mg 

<140> 
<141> 

<150> EP98111506.6 
<151> 1998-06-23 

<150> EP99107554.0 
<151> 1999-04-15 

<160> 38 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 44 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oligo probe 
<400> 1 

gcstacatca tccacacstc sggsacsacs ggsaagccsa aggg 



<210> 2 
<211> 44 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: ^.ol±go 
probe 

<400> 2 

ggstacatca tctacacsag cggsacsacs ggsaagccsa aggg 



<210> 3 

1 
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<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligo probe 
<400> 3 

akgctgtcsc csccsagsnn gaagaagtyg tcgtcgatsc c 



<210> 4 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligo probe 
<400> 4 

akggagtcsc csccsagsnn gaagaagtyg tcgtcgatsc c 



<210> 5 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 5 

tttttggtac ctgacgtccc gaaggcgtg 



<210> 6 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 6 

cagcttgtcc atggcgga 



<210> 7 



SUBSTITUTE SHEET {Rule 26) 
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<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 7 

tctgtccgcc atggacaagc 



<210> 8 
<211>32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 8 

tttttggatc cggctaacta actaaaccga ga 



<210> 9 
<211> 200 
<212> DNA 

<213> Planobispora rosea 
<400> 9 

ggatcccgag caccgaccag ccgtgggcgg 
ccgacgactc cagcacggcc aggcccgcgg 
ccacggttga gcagggtgag cagtgcccgg 
gcggcggccc ggttgctcag 



ggacgagaca cgggtctccc ggagcctccc 60 
cctcgaccgg gaagcggtag ggcctgtcgt 120 
ccgggatggt ccgggtcagc cgaggccagc 180 

200 



<210> 10 
<211> 200 
<212> DNA 

<213> Planobispora rosea 
<400> 10 

ggatcccgag caccgaccag ccgtgggcgg 
ccgacgactc cagcacggcc aggcccgcgg 
ccacggttga gcagggtgag cagtgcccgg 
gcggcggccc ggttgctcag 



ggacgagaca cgggtctccc ggagcctccc 60 
cctcgaccgg gaagcggtag ggcctgtcgt 120 
ccgggatggt ccgggtcagc cgaggccagc 180 

200 



<210> 11 



SUBSTITUTE SHEET (Rule 26) 



WO 99/67374 



<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220* 

<223> Description of Artificial Sequence: linker 
<400> 11 

gatctaagct tggggg 

<210> 12 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: linker 

<400> 12 
cccccaagct ta 

<210> 13 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 13 

gagctctcat gtttgacagc t 

<210> 14 
<211> 24 
<212> DttA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 14 

gagctctgac ttccgcgttt ccag 
<210> 15 

4 



SUBSTITUTE SHEET (Rule 26) 



WO 99/67374 



PCT/EP99/04079 



<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR primer 
<400> 15 

tttttgaatt cggtaccagc cgacggcga 



<210> 16 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description, of Artificial Sequence: PCR primer 
<400> 16 

tttttggatc cctgttccac cagcgcacc 29 



<210> 17 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 17 

ttttttctag accgtcgtcg gtggttctc 29 



<210> 18 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial; Sequence: PCR primer 
<400> 18 

29 

tttttggatc caggaagccc tgtgctgtc 



<210> 19 
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<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 19 

ttttttgtag aggtcaagat ccggggcat 



<210> 20 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 20 

tttttctgca ggacagcgcc cttgaggtg 



<210> 21 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 21 

tttttctgca ggcgacgaag aggggc 



<210> 22 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 22 

tttttaagct tagcgcgacc ggggcggt 



<210> 23 

6 

SUBSTITUTE SHEET (Rule 26) 



WO 99/67374 
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<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 23 

catgggaatt cggggg 

<210> 24 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 

<400> 24 
cccccgaatt cc 

<210> 25 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 25 

tttttggatc cggggcagcg gttggttcc 



<210> 26 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 26 

ttttttctag aaggcagctc cagatgatc 



<210> 27 

7 
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<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 27 

tttttctaga ccggactcgg ccggctcg 



<210> 28 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 28 

tttttctgca gccgcacgcc tcggtggtc 



<210> 29 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 29 

tttttctgca gggaccctga gtgcaggtg 



<210> 30 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 30 

tttttaagct tcagtagccg tcgctgagc 



<210> 31 

8 
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<211> 29 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 31 

tttttgaatt ctgcagaccg ccgaggaag 



<210> 32 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 32 

tttttggatc cggagtcgta gctgacgac 



<210> 33 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 33 

ttttggatcc cgaccacgcg gggacgtc 



<210> 34 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 34 

ttttttctag accagggaac ccgtgctgc 



<210> 35 



9 
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<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 35 

ttttttctag acggaagctc gccgcgatc 



<210> 36 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 36 

tttttctgca ggtccgtagc ccggacacc 



<210> 37 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 37 

tttttctgca gttcgggcga cagttcctt 



<210> 38 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
<400> 38 

tttttaagct tcaacaagcc atccgggtc 
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